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Louis Leon Thurstone 


With the death of L. L. Thurstone on September 29, 1955, psychology 
lost one of its greatest, a unique figure on the psychological scene and one to 
whom psychologists will always be indebted. If any psychologist of the past 
quarter century deserved to be called Mr. Psychological Measurement, it was 
he. His major professional objective coincided with that of the Psychometric 
Society and of Psychometrika, both of which were founded under his leadership: 
The development of psychology as a quantitative, rational science. By virtue 
of his own contributions and his influence on others, psychology has taken 
long steps in the direction of fulfillment of this objective. No major aspect of 
the field of measurement was untouched by him. 

Louis Leon Thurstone was born May 29, 1887, in Chicago, where in later 
years he spent the greater portion of his professional life and achieved his 
greatest distinction, at the University of Chicago. His parents were of native, 
Swedish stock, his father’s occupations being, in turn, military instructor, 
Lutheran pastor, editor, and publisher. Owing to a mobile family life, Thur- 
stone went to school in Illinois; Mississippi; Stockholm, Sweden; and James- 
town, New York. He attended Cornell University, where he specialized in 
engineering. Considering the few instances of which the writer has known in 
which psychologists have started from a base of engineering training, he has 
often thought that we should be better off if more psychologists had taken 
that educational route. 

It was during his engineering-school days that the problem of the learning 
curve, and hence psychology, caught Thurstone’s attention. On graduation, 
however, he was offered a position in the laboratory of Thomas A. Edison, 
where he spent the year of 1912. During the next two academic years, he 
taught engineering courses‘at the University of Minnesota, and there began 
his study of experimental psychology. Graduate work followed at the Uni- 
versity of Chicago. In 1915 he accepted an assistantship in the new and active 
laboratory established by Walter V. Bingham at the Carnegie Institute of 
Technology. He received his doctorate from Chicago, with a dissertation on 
the learning curve. His academic rise at Carnegie was something of a record. 
Beginning with the rank of instructor in 1917, with a promotion each year he 
became professor and head of the department by 1920. 

The year of 1923-24 was spent in Washington, D. C., with the Institute 
for Government Research, an agency devoted to the improvement of civil- 
service practices. From that time on, Thurstone had considerable influence, 
directly or indirectly, upon civil-service procedures. 

After his marriage in the summer of 1924 to Thelma Gwinn, Thurstone 
assumed his professorship at the University of Chicago. In the course of time, 
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he had much to do with initiating and setting the pattern for the University’s 
distinguished Board of Examinations. In 1938 he was honored with the 
appointment as Charles F. Grey Distinguished Service Professor. In 1948 he 
was Visiting Professor at the University of Frankfurt, and in the spring 
semester of 1953 at the University of Stockholm. He retired from Chicago in 
1952, at which time he became Research Professor and Director of the Psy- 
chometric Laboratory at the University of North Carolina, which was his 
professional affiliation at the time of his death. 

His unquestioned creative productivity can perhaps be attributed to 
certain traits that seem to stand out—his dissatisfaction with the status of 
psychology as he found it, his keen analytical ability, and his independence 
and originality of thought. These qualities showed themselves in a number of 
ways. For example, his originality was demonstrated relatively early. While 
a college undergraduate he developed a novel method for trisecting an angle 
and published a paper on it in the Scientific American. By the time he gradu- 
ated he had developed a motion-picture camera and projector of unusual 
design. It was this that brought him to the attention of Edison. His inde- 
pendence of thinking showed itself in the fact that he did not read widely in 
the psychological literature, as he was quite willing to admit, and in his general 
choice of some of the less popular subjects to which to devote his energies. 

His dissatisfactions were many. He was discontented with the state of 
affairs he found in connection with psychological testing, where a rapidly 
growing practice seemed to have little or no underlying theory. It was in 1924 
that he published his only attempt at general psychological theory, in his 
book entitled The Nature of Intelligence. More clearly, the need for test 
theory led him into the development of quantitative formulations, culminating 
in his multiple-factor analysis. His impatience with what he considered to be 
the severe limitations of classical psychophysics led to his development of 
basic theory of psychological measurement as effected through human judg- 
ments. He regarded his law of comparative judgment as one of his most 
important achievements. 

His performance as an analytical thinker was most clearly brought to the 
attention of the writer when, in the fall quarter of 1935, he was privileged to 
attend Thurstone’s seminar. Whenever a new problem came up in the seminar, 
it was a revelation to observe him go to work on it. He very quickly went to 
the heart of the problem, singling out the important variables in a way that 
made the problem appear simple. Incidentally, attending the seminar then 
and later were a large number of students, some at the post-doctoral level, 
who have achieved distinction in the field of psychometrics in their own right. 

Thurstone’s many contributions are so well known to readers of Psycho- 
metrika that they need not be enumerated here. Although he has been held in 
great respect by those outside the field of measurement, many of his develop- 
ments have not had the general impact that they should have had. Thurstone 
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did not found a school, nor did he deal in popular or spectacular subjects. He 
often spoke to ‘deaf ears’’ because so few psychologists were prepared by 
virtue of interest or mathematical preparation to listen. Perhaps his contribu- 
tions that have gained the widest notice are his attitude-scaling methods and 
attitude scales, and his discovery of primary mental abilities and his tests of 
them. Of the many quantitative methods that he has provided, probably that 
of multiple-factor analysis stands first by a wide margin in potential usefulness 
and impact upon psychology in general. 

It is not so well known that in the later years of his life his energies were 
very much devoted to performance tests of personality. This interest sprang 
in part from dissatisfactions with both projective tests and personality in- 
ventories. He was also working on material for a book that was to give a 
general treatment of psychological measurement and on revisions of his tests 
of primary mental abilities. It is hoped that many of these last efforts had gone 
far enough to reach publication. 

Thurstone was always quite willing to acknowledge the assistance from 
his wife, Thelma. They collaborated on many studies and for years were 
jointly responsible for the American Council on Education college-aptitude 
examination. Their three sons are grown and started on their various pro- 
fessional careers. Besides the immediate family, Thurstone leaves behind 
a great many loyal and capable former students, who follow in the Thurstone- 
Chicago tradition, as well as many admirers around the world. 


University of Southern California J. P. Guilford 
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PSYCHOMETRIC THEORY: GENERAL AND SPECIFIC* 
Lepyarp R Tucker 


PRINCETON UNIVERSITY 
AND 
EDUCATIONAL TESTING SERVICE 


Fellow members of the Psychometric Society, and our colleagues, 
members of Division 5 of the American Psychological Association, I under- 
stand my mission tonight is to provide you with both humor and a mild 
message. Precedent has established an eminently high standard. Maybe I 
can explain the humbleness with which I approach this task by relating the 
account of an episode. This occurrence, like much illustrative data in Psycho- 
metrika, is fictitious, of course. 

During my trip from the splendorous East Coast to the beauteous West 
Coast, I took vacation time to tour part of the magnificent country in be- 
tween, as many of you, undoubtedly, did also. One afternoon I came upon a 
small tent village in a fairly remote mountain section. Not that such camps 
were so uncommon as to be remarkable in themselves; something about this 
particular camp caught my eye. In the center of this camp was a rustic 
conference table flanked on one side by a large blackboard mounted on cedar 
posts. This in itself was enough to raise a moderate interest, but my curiosity 
was intensely stimulated by another observation. Scribbled on this black- 
board were a number of familiar symbols. Here was a sigma, there was a 
derivative, another place was a box labelled “factor matrix.”” How could 
this be? Needless to say I stopped to investigate. To my extreme pleasure 
I found myself among friends. A small group of Psychometricians had set 
up a summer seminar. There was nothing for me to do but to stay for dinner. 

Before and during dinner we chatted about this and that. There were 
a number of requests for news such as: Were the Brooklyn Dodgers still run- 
ning away with the race in the National League? Had Senator McCarthy 
started a new investigation? What were the leading plays in the summer 
theaters? After these matters had become settled, we sat back quietly for a 
period. This silence was ended by one member, whom, 'of course, I won’t name, 
exclaiming: “Say, I’ve got one that should work. How about this? 751955.” 
There was a short pause for ten seconds while the others seemed to stop 
and contemplate the proposition. Then there was loud and long laughter. 


*Presidential Address to the Psychometric Society, September 5, 1955. 
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In the enthusiasm of the moment, another member called out ‘‘That would 
be like 683291.” Again there was laughter. Later in these proceedings I 
was able to break in and ask for information. I was told that the group had 
found the off hours to become a bit boring after the first week of their con- 
ference. They had heard each other’s jokes several times over and were 
tired of hiking and fishing. As an extra-curricular activity they had started 
to code the accumulation of humor. The numbers I had heard were such 
coded jokes. I decided to try this situation and so called out 10121492. 
There was not the least smile. Upon my inquiry of what was wrong, I wasn’t 
given the hackneyed answer that ‘some could tell them and some could not.” 
I had under-estimated our colleagues. They had not just gone about listing 
all jokes that they could think of and giving them a serial number in order 
of occurrence. They had gone about this activity like true Psychometricians. 
They had set up a system of analysis and classification of humor and had 
applied it to their sample of jokes. By use of systems of binary bits and 
addends of various other orders, they had established a system to emcompass 
all possible jokes. Each sub-sub-etc. class, consisting of the jokes in one cell 
of this multiple classification system, had a unique identifying code which 
incorporated as meaningful units the position on each mode of classification. 
These identifications were then transformed to numbers in the decimal 
system. By considering the implications of any of the almost infinite number 
of empty cells they might conclude that the situation was humorous and have 
generated a new joke. This was what I had seen going on. It was a life-saver 
for the group, for otherwise they would not have been able to put up with 
each other for more than the second week. My weakness was that I had not 
realized the nature of their system and of its subtleties. I had pulled several 
boners. No wonder my selection was not humorous. Worst of all I had picked 
the lowest classification on the built-in criterion. I had told them that this 
joke was not the least bit funny. 

With the field of humor now having been thoroughly studied, we may 
turn our attentions to other areas. For a short period I will direct my attention 
to material related to the title of this address, Psychometric Theory: General 
and Specific. Here I am taking the privilege of expressing my opinions on a 
recommended course of action for Psychometrics. My major concern is to 
maximize the extent to which observations of psychological phenomena can 
be validly matched with expectations obtained from rational theories. 

While not wanting to dwell upon the philosophy of science—not that I 
would be able to do so if I wished—there are several aspects of my statement 
that warrant short discussion. By observations I mean the records obtained 
by definite procedures during the course of phenomena or of the observation 
of phenomena. The observations are subject to operational definitions. By 
expectations I mean statements describing the expected nature of observa- 
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tions. Such statements might vary on a continuum between vagueness and 
definiteness. I wish to consider definite statements as constituting definite 
expectations. Any vagueness could be considered as the introduction of 
approximation in the expectation. Thus, we might have approximate expecta- 
tions. 

I might illustrate by reference to color matching. The theory of the 
color pyramid would indicate that a given combination of color disks in 
particular proportions on one color wheel would be matched by another 
group of color disks on another wheel in given proportions. When the colors 
of these disks and the various proportions are specified, a definite expectation 
is produced. A corresponding observation could be obtained in the laboratory. 
A subject would be shown the color wheel with the standard combination 
of disks. The second wheel could be supplied with the other set of disks and 
the subject be asked to adjust the proportions until the color produced 
matched the color of the first wheel. Our observation would contain the 
specifications of the situation and a record of the proportions established by 
the subject. 

I have avoided saying that observations were to be represented by 
rational theories. To make this latter statement would permit use of theories 
with infinite degrees of freedom so that these theories could be adjusted to 
represent any collection of observations, past, present and future. These 
theories would have no power, however, in providing definite statements of 
expectations concerning observations other than those observations the 
theories were adjusted to represent. In contrast, I have chosen to emphasize 
the correspondence between theoretically derived expectations and observa- 
tions. The power of a theory is directly related to the number of definite 
expectations it produces and inversely to the errors between these expectations 
and the observations. 

Let us consider the role of general psychometric theories in maximizing 
the extent to which derived expectations correspond to observations. Two 
aspects of this maximization appear: First, there may be a maximum cor- 
respondence, that is, minimum errors between the expectations and the 
observations. Second, the theory may yield a broad range of expectations. 
This is the place where general theories hold promise. We will all, undoubtedly, 
agree that it is desirable to have a systematic theory which will provide 
valid expectations for a large number of kinds of observations for many 
phenomena. There is a necessity, though, that we know how to use this 
theory to arrive at these expectations. Note my boner at the summer seminar 
previously described. As a further illustration, consider an advertisement 
that appeared a while ago in one of the weekly Princeton papers: 


Lost: A black female cocker spaniel. Will not answer to ‘‘Daisy,” 
although that is her name. 
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Unless each theory explicitly indicates phenomena and observations to which 
it is applicable, we may be in no better position than would someone search- 
ing for Daisy. I fear that in a number of instances we have mistaken vagueness 
for generality. 

One may be interested in the relation between a method for analysis of 
data and a theory. In some senses a method of analysis might be considered 
as a general theory. A proper description of any such method includes a 
statement of the necessary conditions of situations in which the method 
might be employed. In a sense, however, we may contrast a method with a 
theory. This is in the specificity of definitions. If a construct might cor- 
respond to different kinds of observations in different situations, an ambiguity 
exists. A method that is applicable in several situations may be thought 
of as a theory in each situation. Each of these theories is separate from 
the other in that it includes the further definitions necessary to fit the method 
to the situation. Thus, a method may be considered as a family of theories. 
For example, multiple-factor analysis might be considered as a family of 
theories. Any time it is applied to a particular content area, it may be thought 
of as a theory. A method of analysis may be thought of as an abstraction 
from a number of situations with particular contents. In order to produce 
any particular theory from the family of theories represented by a method, 
it is necessary to specify particular contents. 

We may be interested in classifying methods of analysis as more or less 
general. This classification may be taken to correspond to the number of 
different situations in which the method is applicable. A more general method 
would be applicable in a larger number of different situations. However, 
there may be a trend that the more general the method is, in this sense, 
the more extensive the definitions need be in order to make the method 
explicitly applicable to any one particular situation. Thus, the more general 
methods may be further from theories. 

If a theory is to specify the observations and phenomena with which it 
deals, how then can a theory be general? We may wish to indicate the ex- 
tensiveness of a theory’s applicability by the contrasting terms: individual 
versus general. In case a theory concerns a large number of observables in 
a large variety of situations, the theory could be termed a relatively general 
theory. In case a theory concerns only a limited situation, the theory could 
be termed individual, or specific, to this situation. Note the particular usage 
of the word specific as synonymous to individual in this context. Both a 
general and an individual, or specific, theory should specify the observables 
with which it deals as well as situations or a situation in which the theory 
applies. 

Although one might hope that a collection of individual theories could 
be amalgamated into a more general theory, there is some possibility that 
the individual theories may be too divergent to provide a basis for this 
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operation. Many individual theories arise from applications of psychometric 
methods to problems of applied psychology. The particular situations dealt 
with are defined by the needs of particular institutions. While this activity 
is a highly important aspect of psychometrics, its productivity of general 
theory is problematic. There is considerable danger that the practicing 
psychometrician may find himself in the kind of position illustrated by the 
following advertisement, which has also appeared in a weekly Princeton paper: 


Aspiring Artist: Will decorate children’s rooms. Funny animals a 
specialty, but will comply with any unreasonable request. 


It seems to me we should be on guard against at least the extremes of such 
situations. 

There seem to be problems, then, in our attempts at developing highly 
general theories. We may, in fact, produce instead of general theories families 
of individual theories which we might call psychometric methods. Or we 
may be so vague that the theory is unacceptable. Likewise, there are dangers 
in working with individual, or specific, theories. A recommended strategy 
is to attempt theories between these two extremes. The smaller, but not 
individual, theories may net us much in knowledge gained. These theories 
will be more easily established and experimented on by individuals. And 
finally we might hope that several such theories would be compatible for 
amalgamation with more general theories. 


Manuscript received 8/10/55 
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F-TEST BIAS FOR EXPERIMENTAL DESIGNS OF THE 
LATIN SQUARE TYPE 


NEIL GouRLAY 
UNIVERSITY OF BIRMINGHAM 


In an earlier paper, a method of analysis, due to Neyman and now 
known generally as variance component analysis, was used to examine F-test 
bias for experimental designs in education of the randomized block type. 
The same method is now applied to study F-test bias for designs of the Latin 
square type. The results, in general, disprove the view that, for a valid appli- 
cation of Latin square techniques, it is necessary that all interactions are zero. 


In an earlier paper (5), a study was made of F-test bias for experimental 
designs in educational research of the randomized block type. In this paper, 
a similar study is made of those designs of which the simple Latin square 
is the prototype. The B-ratio technique, due to Neyman and described in 
the earlier paper, is again employed. 

As McNemar (8) points out, the usual textbook statement of the theory 
underlying the use of the Latin square implies zero interaction between 
the main effects. McNemar claims that where the assumption of zero inter- 
action is not met, investigators will obtain too many “significant F’s’’; he 
then goes on to conclude that, since significant interactions are so common in 
psychological research, the Latin square is seldom appropriate and that 
‘St is defensible only in those rare cases where one has sound a priori reasons 
for believing that the interactions are zero.” 

In the discussion which follows it will be shown that McNemar is by 
no means altogether correct in his point of view. It would appear that he has 
failed to realize that in the field of education and psychology, the application 
of the Latin square design has progressed beyond the usual simple textbook 
formulation and that some of the later applications show the need for 
modification of his rather sweeping generalization. In particular, it will be 
shown that the Latin square can be applied in several cases where the inter- 
actions are not zero; also, that in those cases where bias is present, it may well 
be negative and not positive as McNemar would maintain—a result which 
can only increase and not diminish the significance of any F-test. As defined in 
the earlier paper (5), an F-test is said to be positively or negatively biased, 
if, when the null hypothesis being tested is correct, it gives rise to a larger 
or smaller proportion, respectively, of significant F-ratios than is warranted 
by the F-distribution. 
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It will be helpful to the treatment of our problem if we distinguish 
between the two main types of interaction that can occur in psychology: 


Type A—where each individual (or whatever the unit may 
be—class, grade, etc.) receives only one of the several treat- 
ments applied and is represented by only one measurement in 
the data to be analyzed. In this case, interaction is between 
main effects, such as treatments, schools, etc. 

Type B—where repeated measurements are made on the 
same individual or group. In this case conditions may differ 
or different treatments may intervene; earlier measurements 
(treatments, etc.) may affect, i.e., interact with, those that 
follow. 


The position is still further complicated in that Type B interaction 
may be accompanied by Type A. Also, besides pure interaction effects, the 
interaction component of any analysis of variance may contain other terms— 
described as group errors in the previous paper (5). 

1. Applications of the Latin Square Involving Type A Interaction Only 


1.1. An experiment comparing several methods of teaching some school topic 


For simplicity let us take the case of a 3 X 3 square, say: 


Streams 
u v w 
f A B C 
Schools g B C A 
h C A B 


i.e., in each school there are three experimental groups which are subjected 
to the three methods A, B, and C and which can be classified according to 
some other factor (e.g., streams). (For the benefit of some readers it is to 
be explained that in many English schools the children in each grade are 
assigned to classes according to level of ability. The process is known as 
streaming. A three-stream school is one in which there are three classes in 
each grade representing three levels of ability.) It will be assumed that the 
numbers in each group are equal—to avoid bias as discussed in the earlier 
paper (5). 

Then we may consider two hypotheses: either (7) a particular hypothesis 
—the methods have the same mean effect when an average is taken for each 
method over the three schools and the three streams; or (77) a general hy- 
pothesis—the methods have the same mean effect when an average is taken 
over the total population of schools (of which the three given schools are a 
random sample) and the three streams. 


NEIL GOURLAY 


The first hypothesis is of little interest to the practical investigator. 
Furthermore, when real interaction is present between methods, schools 
and streams, the hypothesis cannot validly be tested by the Latin square 
design and analysis. There are obviously insufficient data: a factorial design 
is required. But it does not follow, as McNemar maintains, that the Latin 
square analysis would be positively biased. It might be positively biased in 
certain cases, but in others, it would be negatively biased, e.g., when real 
interaction exists only between two of the three classifications (the Latin 
square design then becomes effectively factorial). 

Now let us consider the bias involved in the Latin square analysis when 
used to test the general hypothesis. It is now necessary to think of the three 
schools as a random sample from the total population of schools (which, as 
usual, will be taken as being infinite in number). Also the three schools 
should be assigned at random to the rows of the Latin square. 

Then the mean scores for the nine experimental groups might be repre- 
sented as follows: 


tot mates tT tT at nwatés (1) 


where 


(i) The general mean over the three methods, the three streams, and 
the total population of schools has been taken as zero. 

(ii) The z-values are the main effects of schools, the z-value for each 
school being the mean for the school taken over the nine method-stream 
combinations. (It will be assumed that the total population of 7-values has 
zero mean and variance c? .) 

(zi) The Q’s are the main effects of streams over the three methods 
and the total population of schools, and >>, Q = 0, where >.,, F is defined 
to be the sum of all the terms or expressions of type F, and n denotes the 
total number of such terms. 

(iv) The T’s are the main effects of the methods over the three streams 
and the total population of schools, and >>; 7 = 0. 

(v) The nine n’s represent the real interaction effects of the three methods 
and the three streams (over the total population of schools) and are such that 


me =0= 2, (k A,B, C), (2) 
=0= Doms = De (1 = w). 
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(vt) The e’s include real interaction between schools, on the one hand, 
and method-stream combinations, on the other, and group and sampling 
error. Each ¢ could in fact be expressed as 


e= 


where ¢ is the real interaction term for the school and method-stream com- 
bination to which the given e belongs, and £ is a purely random term made 
up of group error and sampling error. 

The infinite population of ¢’s (of ¢’s and ¢’s) corresponding to any one 
cell of the Latin square will have zero mean. We shall assume that the nine 
populations of «values (one for each cell) have the same variance o; (the 
usual assumption of homogeneity of variance). 

The e’s of the cells in any one row of the square will be correlated since 
their ¢-components are correlated. A few words of explanation might be 
helpful. For each school there are nine ¢-values (one for each method-stream 
combination) and their sum is zero; this follows from definition (77) above. 
Also, for each method-stream combination there will be an infinite population 
of ¢-values (one ¢ for each school). Furthermore, these nine populations of 
f-values will be correlated because the sum of the nine ¢-values for each 
school is zero. Since schools are assigned at random to the rows of the Latin 
square, it is not very difficult to see that, while the ¢-values which appear in 
the three cells of any one row will be correlated with one another, they will 
not be correlated with the ¢-values in the cells of the other two rows. 

The correlations (nine in all) between the e-values may not all be the 
same. This will happen when heterogeneity of correlation exists between 
the ¢-interaction effects. [A simple example of this type of heterogeneity is 
discussed more fully in the first paper (5).] 

Let the correlations in the first row be denoted by pi2 , po; , pis , Where 
Pi2 represents the correlations between the e’s in the first and second cells 
of the row and so on. Let the other six correlations be ps5 , ps5 , Ps ANd prs , 
Peo » Pro - Also let R = Doo p. 

Now let us derive the E. V.’s (expected values) of the different sums of 
squares of the variance analysis over the total population of schools. The 
total sum of squares is 


By equations above, 


Tat mate) = 3 Ve (4) 


It follows that the E. V. of total sum of squares is 


a 
4 
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The sum of squares between methods is 
> (M, — M;)’, (6) 
where 
Ms, — Mz = — Ts) + + 6 + &) — (2 +e + &)], (7) 
and 
E.V. (Ta Ts)" + [60% + pa + Pso)]. (8) 
Therefore, the E. V. of sum of squares between methods is 


(Ts — Ts) + 20% 


(9) 
2 2 2R 2 . 
=3>T + 20, — (since >) T = 0). 
3 3 
Similarly, the sum of squares between streams has E. V. 
3? +203 (10) 
3 
The sum of squares between schools is 
(M, M,)’, (11) 


where 
M, M, = (x, T,) + 3[(mua + NoB + Nwc) (nus + Noe + Nwa)] (12) 
+ +e +6) — + és) 


and 
EV. (My M,)* = 202 + 5 + + toc) + me + toa)? 


2 
+ + (Pi + pos + pis + Pas + + Pas). 


It follows, on reduction, that the E. V. of the sum of squares between schools is 


+ 3 (tua + + + 20% + 


The E. V. of the residual sum of squares can now be found by subtraction, 
but it can also be found directly after the manner of the other variance 
components. (The second procedure provides a check on the algebra in that 
the E. V. for total should equal the sum of the other E. V.’s). By either method 
the E. V. for residual is 


(nua + Nec + + 20% (15) 


} 
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It is to be noted that 


This is easily deduced from (2). 

We are now in a position to apply the B-ratio technique to examine the 
bias involved in the F-test methods v. residual. The B-ratio for this test is 
obtained by (7) applying the null hypothesis that the main effects for methods 
are equal, ie., 7, = Ty = Tc (= O since >»; T = 0) and (ii) taking the 
ratio of the E. V. of the methods variance to the E. V. of the residual variance. 
It will be seen that it has the value 


which will normally be less than unity if there is any real interaction between 
methods and streams. For the 3 X 3 Latin square, there are essentially two 
random arrangements: 


ABC and AC B 
BCA BAC 
C AB C BOA 


For the second arrangement, the B-ratio is of the form 


(1 /| (1 + (mua + tee + |. 


It obviously leads to the same conclusions that apply to the first arrangement. 

This suggests that the bias of the F-test will be negative and not positive 
as McNemar claimed. But another factor to be allowed for is the hetero- 
geneity of correlation between the ¢-values. As was shown in the previous 
paper, this heterogeneity produces positive bias. Whether the combined 
effect of the two types of bias is positive or negative will depend on various 
factors. But the point to be emphasised is that, for this application of the 
Latin square, the bias, if not negative, is at least less than that for the fac- 
torial type of experiment. If the bias is unimportant in the one case, it must 
a fortiori be unimportant in the other. 

Unequal numbers of cases in the cells of the Latin square (whether 
proportionate or disproportionate with regard to the three classifications, 
schools, etc.) will also introduce bias into the F-test. This type of bias is 
discussed in the earlier paper for the case of the replicated (factorial) type 
of experiment. 
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1.2. Alternative designs 


A single Latin square is not a practical design for a methods experiment. 
It provides too few degrees of freedom for the estimation of the residual or 
error variance. To obtain increased precision, two courses are available: 
either (1.2a) the replication of the same type of square; or (1.2b) the use of 
two or more different types of square with or without replication. In both 
cases, each square and each replication of it requires a separate sample of 
schools. 

There would appear to be no special merit in preferring course (1.2a) 
to (1.2b) as some investigators have done—under the belief, presumably, 
that the use of as many different squares as possible is a necessary part of 
the randomization process. It is quite sufficient for randomization that the 
schools are allocated at random to the rows of a square [whether the same 
square throughout as in (1.2a) or to different squares as in (1.20)]. 


1.2a. The replication of the same square 


For simplicity, we will again take the 3 X 3 square discussed in section 
1.1. Let there be n replications, involving, therefore, 3n schools in all. Then 
by an analysis very similar to that of section 1.1, it can be shown that the 
expected values for the different components of the variance analysis are as 
in Table 1 (same notation as before). 

Two observations can be made: (7) By testing methods against residual 
within schools instead of square residual, a much more precise test is obtained. 
Furthermore, this F-test cannot be biased (negatively) as a result of any 
real interaction between methods and streams. It will, however, like the other 
test, be subject to any positive bias arising from heterogeneity of correlation 
between the cells in any one row. (iz) A test of interaction between methods 
and streams is provided by the F-tests: square residual v. residual within 
schools and rows v. residual between schools. 

If interaction were shown to be present, and it were desirable that the 
methods should be compared for the three streams separately (instead of an 
average being taken as for the null hypothesis tested above), this could 
easily be achieved by analyzing the results for each stream separately. 
The precision of the tests involved would, however, be poor since school 
differences would be contained in the error variances. 


1.2b. Use of two or more types of square with or without replication 


An analysis similar in type to (1.2a) can again be carried out. The precise 
form the analysis takes will, of course, vary with the types of squares selected 
and the numbers of replications. To save space no such analysis is reproduced 
here. It might, however, be pointed out that, where 3 X 3 squares are in- 
volved, there are only two possible types of square and with the same number 
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of replications of each, the analysis bears a certain similarity to that dis- 
cussed on p. 283 et seg. (See also Table 2.) The conclusions to be drawn with 
regard to bias are the same as for case (1.2a). 


1.3. More complicated applications 


Consider the designs given in Figure 1. 


Ages 
‘ Boys Girls 
Boys 2 Methods a bic id abcd 
Schools a 3 A,B,C,0 
Girls 2 Methods 
only 
“Design la Design 1b 
FIGURE 1 


Design la, or something very similar, was used by Burt and Lewis (2). 
Once again it can be shown that the B-ratio for design 1a is less than unity, 
although as in all the other cases discussed, positive bias can result from 
heterogeneity of correlation between the four cells in each of the eight rows. 

But the same cannot be said for design 1b. In this case it can be shown 
that with real interaction present, the B-ratio is likely to be greater than unity: 
a more serious degree of positive bias may, therefore, be present in the F-test. 


2. Applications of the Latin Square Involving Type B Interaction 


Most of the Latin square studies reported in journals have been of the 
type where the subjects of the experiments have been subjected to a suc- 
cession of treatments and tested for each treatment: they could therefore 
involve Type B interaction [cf. Thomson (11), Sutherland (10), Grant (6), 
Edwards (4), and Archer (1)]. 

The Latin square design requires the order of succession of treatments 
to vary for the individuals and thus makes the problem of interaction more 
complicated than that which deals with Type A interaction only. 
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2.1. The case of a single Latin square 
For example, consider three individuals subjected successively to three 
treatments A, B, and C in the following orders: 


Individuals 2 B C A 


With interaction present (whether type A or B), as stated earlier, the square 
does not provide sufficient data to test any worth-while hypothesis for the 
three treatments and the three individuals. 

Can a general hypothesis be tested? Is there any difference between 
treatments when averaged over the total population of individuals of which 
our three cases are a sample? Once again the data are insufficient if type B 
interaction is present. The square involves only three of- the possible six 
orders for the treatments, and all six orders would have to be considered in 
order to obtain a worth-while generalized result. We will examine the latter 
possibility presently. The only conclusion to be drawn is that a single Latin 
square is of little use when type B interaction is present or suspected. This, 
of course, is in accord with McNemar’s point of view. 


2.2. Replication of the same square 


In this case each of the treatment sequences is applied not just to one 
individual but to a group of individuals. The applications of both Thomson 
(11) and Sutherland (10) fall into this category. (Sutherland’s square is a 
Greco-Latin square but the same principles apply.) The method was also 
used by Corrigan and Brogden (3), whose application was discussed by 
Grant (6). Edwards (4) also illustrates the method. 

In outward form, the analysis is very similar to that already discussed 
on p. 279. Schools is replaced by individuals and rows now represents groups 
of individuals undergoing different treatment sequences. Streams is replaced 
by some other classification. But there are some important differences which, 
to save space, we will only indicate: 

(z) Besides possible type A interaction terms, there may be also type B 
interaction terms affecting each of the first four variance components (see 
Table 1). 

(iz) In the earlier analysis, group errors (i.e., errors, other than random 
error, peculiar to a school group) were included in o% and, therefore, affected 
all variance components. But in the present case, it will be seen that group 
errors will vary from cell to cell of the Latin square but will be the same for 
all the individual measurements in a cell, i.e., group error variance will form 
part of the first four variance components of the analysis but not of the two 
residual variances. 


1 ¢ | 

3 C A B 
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What tests may be applied and what is the position with regard to bias? 
(¢) An important test is that of square residual (or uniqueness) against residual 
within individuals. If significant, this may indicate either type A or B inter- 
action, or group errors, or some combination of the three; the analysis cannot 
differentiate. With a significant result, there is little point in proceeding 
further. The test treatments v. residual within individuals would have such a 
limited interpretation that it would be virtually valueless; as a test of a general 
hypothesis about treatments, the test treatments v. square residual would be 
biased to an unknown extent. 

(iz) If non-significance is obtained for the test of square uniqueness, a 
further test of zero interaction (and group error) is provided by taking 
rows (groups or sequences) against residual between individuals, provided the 
groups were random in the first place. 

(iz) With both these tests non-significant, the other main effects might 
be tested against residual within individuals. But the reader must be warned 
against following such a test sequence blindly. It is to be remembered that a 
statistical test cannot prove the null hypothesis on which it is based; although 
the two preliminary tests are non-significant, it may still be the case that 
interaction (and/or group error) is present. A priori knowledge as to the 
likelihood of interaction and group error is obviously important. Where 
past experience would suggest that no interaction or group error is likely to 
be present, and the two preliminary tests confirm this, the tests of main 
effects against residual within individuals can be made with some safety. 
But where interaction or group error is known to be likely, little reliance 
can be placed on the tests of main effects, even though the preliminary tests 
give a non-significant result. In other words, the given experimental design is 
almost useless for dealing with this situation. 

Corrigan and Brogden’s data (3) show non-significance for both pre- 
liminary tests. Sutherland’s data (10) show significance for the second test 
(the groups were not random) and would appear to show significance also for 
the first test; this would, of course, invalidate his other tests. 

Edwards (4, p. 325) seems to regard square residual and residual within 
individuals as estimates of the same variance but, as we have seen, this can 
only be the case when interaction and group errors are zero. 


2.3. Analysis involving complete sets of squares 


When type B interaction is present or suspected, it is obvious that all 
possible treatment sequences must be considered if a generalized result is 
to be obtained. Grant (6) discusses this case. 

Once again, for simplicity, we will consider the case of the 3 X 3 square. 
We will assume that individuals are assigned at random to the rows. There 
are then effectively only two Latin squares involved, corresponding to the 


six possible orders of treatment ABC, BCA, CAB and ACB, BAC, CBA. 
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For convenience we will give the eighteen cells of the design the numbers 
1 through 18, in the order just stated, for the six sequences. 

We will consider the case of n replications of the design, i.e., n individuals 
will be assigned to each row (or sequence). Then each of the n entries in any 
one of the eighteen cells may be represented as the sum of six terms of the 


form 
(17) 


where 

(¢) The general mean over the eighteen cells and the total population of 
individuals (assumed infinite) has been taken as zero. 

(iz) The z-values (6n in all) are the main effects of individuals averaged 
over the six sequences. (It will be assumed that the total population of 
m-values has zero mean and variance a.) 

(77) The Q’s (Q. , Q, , Q,) are the main effects of columns, averaged 
over the six sequences and the total population of individuals, and >>; Q = 0. 

(iv) The T’s (14 , Tz , Tc) are the main effects of treatments averaged 
over the six sequences and the total population of individuals, and >>, 7 = 0. 

(v) The n’s (18 in all) represent the joint effect of type A and B inter- 
action for the 18 cells and are such that the six sums of six 7-terms correspond- 
ing to the three treatments and the three columns are each zero. 

(vi) The é’s (18 in all) represent possible group or cell errors. (It will be 
assumed that they are random and that the total population of values has 
zero mean and variance ¢; .) 

(viz) The e’s (18 in all) represent the residuals within cells after all the 
other effects have been taken out. It will be assumed that the total population 
of ¢’s for each cell has zero mean and variance o, . Then the e’s of the cells in 
any one row will be correlated. Let p,> , p23 , p13 represent the correlations for 
the first row and so on. Also let R = pe p. 

The derivation of the E. V.’s of the different sums of squares in the vari- 
ance analysis is not reproduced here but the results are given in Table 2. 

What conclusions are to be drawn from this analysis? 

(i) If one or both of the F-tests residual between cells v. residual within 
individuals and rows (or sequences) v. residual between individuals is significant, 
non-zero interaction and/or group errors is indicated; the analysis cannot 
differentiate. It would then be invalid to test treatments (or columns) against 
residual within individuals unless there was other evidence (possibly arising 
from the design of the experiment) to show that group errors were not present. 

If, on the other hand, the first two tests were non-significant, the test 
of the significance of treatments (or columns) might safely be made. (The 
same warning as appears on p. 283 applies here). 

(iz) It is always possible to test treatments (or columns) against residual 
between cells. The B-ratio for this test is never greater than unity. In this 
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respect the present analysis differs from that for the replication of the same 
square (see previous section), where type B interaction may affect both 
treatments (or columns) and residual between cells to give a B-ratio (and 
therefore a bias) of unknown size. 

Before concluding this section it might be of interest to mention that 
type B interaction is similar to the carry-over effect studied by statisticians 
in animal science [cf. Patterson (9) and Lucas (7)]. Further, the experimental 
model which they consider is very much the same as that treated in this 
section. Their analysis of variance, however, follows quite a different pattern 
and permits the testing of a wider range of hypotheses. One of their findings 
is that there is no bias involved in the testing of unadjusted direct effects 
against the error variance. This agrees with conclusion (7) above. (It must 
be noted that group error does not occur in the animal science experiment.) 


2.4. More complex designs. 


No attempt will be made to consider F-test bias for analyses of more 
complex designs. It should now be apparent that where tests rest on the 
assumption of zero interaction and group error, the design should provide a 
test of this assumption. Also, in cases where such a test proves significant 
(or where the presence of interaction or group error is known to be likely 
even though unrevealed by any test), the design should furnish tests of 
main effects, which, although less precise than those which might otherwise 
have been used, possess B-ratios not exceeding unity. 

Archer (1) shows himself to be aware of the limitations of the designs 
he offers in his paper, but considers that the difficulty could be partially 
overcome by ensuring that the interactions, for which his methods provide 
’ no test, are those which the investigator has decided a priori to be unimportant. 
There is a danger that these a priori decisions may be purely ad hoc assump- 
tions and bear little relation to actual fact. 
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Analogs of Pearson’s coefficient of racial likeness and of Mahalanobis’ 
distance measure have been proposed as descriptive statistics for comparing 
two individuals. This paper shows that two different definitions of ‘“uncor- 
related’ variables—one associated with an inverse transformation and 
the other with a principal-axis transformation—give rise to these two descrip- 
tive statistics. The effects of putting the data into certain forms, such as 
equalizing the variances of the variables or equalizing the means of the 
persons, prior to using either of the two transformations, are discussed. 


The interest in measures for assessing similarity (or dissimilarity) of 
profiles is reflected in such recent summaries and discussions as those of 
Osgood and Suci (5), Gaier and Lee (3), Webster (8), Cronbach and Gleser 
(2), and Thorndike (7). Several of these papers consider the problem of 
similarity of profile for two individuals, as contrasted with two groups. 
The latter problem may be formulated as one of discrimination between 
groups, and, as Cronbach and Gleser point out, two well-known approaches 
to its solution have been tried. One is the Pearson coefficient of racial 
likeness and the other the Mahalanobis distance measure, which is known to 
be related to Fisher’s discriminant function. The analogs of these two measures 
for the problem of comparing two individuals have been suggested in the 
discussions mentioned above. For one, Osgood and Suci and, independently, 
Cronbach and Gleser have suggested a measure that is analogous to the 
Pearson CRL; also, Cronbach and Gleser suggest a Mahalanobis-type 
measure and compare and contrast it with the former. 

The purpose of this paper is to examine these two proposed measures 
of profile similarity as descriptive statistics. In order to do this, the concept 
of Euclidean distance will be reviewed, a matric notation developed to 
describe a distance measure, and then the distinction between these two 
measures considered as a distinction between two definitions of uncorrelated 
variables. Finally, the effects of adopting certain forms of the data upon 
these measures will be outlined. The treatment here follows from well- 
known principles of matrix algebra, and consequently does not offer any 
strictly new propositions. However, it does clarify certain characteristics 
of these two proposed measures and in so doing may assist in describing them 
as descriptive statistics. 
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Euclidean Distance 


The Euclidean distance between two points in space is a well-defined 
concept that has been generalized to a space of any size. Providing that 
the space, of size k, say, has been defined by a rectangular Cartesian system 
of reference axes, then the square of the distance between any two points 
in this space is given by the sum of the squares of the differences between 
paired coordinates of the two points. A rectangular Cartesian system consists 
of k mutually perpendicular (orthogonal) axes; the pairing of coordinates is 
done, of course, with respect to these k reference axes. For example, suppose 
that four persons are located in a space of size two by the following coordi- 
nates with respect to a rectangular Cartesian system: 


Person a Person b Person c Person d 
Axis 1 4 0 1 3 
Axis 2 9 5 8 6 


The square of the distance between persons a and b, say, is given by: 
(4 — 0)? + (9 — 5)? = 82. Since squares are being summed, the result is 
obviously the same if we compute, instead, (0 — 4)? + (5 — 9)’ = 32. 

Designate this matrix as X. A method of securing these Euclidean 
distances is to operate on the matrix X’X, where X’ is the conventional 
transpose of X. For these data, X’X is 


a b c d 
a 97 45 76 66 
b 45 25 40 30 
c 76 40 65 51 
d 66 30 51 45 


The diagonal elements of X’X are simply the sums of the squares of the coor- 
dinates for a given person. The off-diagonal elements are the sums of the 
paired products of the coordinates for the two persons designated by the row 
and column headings. Thus, for person a the diagonal element is 
(4)? + (9)? = 97. The element 45, occurring in row b and column a, and in 
row a and column » as well, is given by (4)(0) + (9)(5) = 45. The square 
of the distance between persons a and b is then given by 97 + 25 — 2(45) = 32, 
as before. This in effect merely uses the principle of rewriting a square of a 
difference between two terms as the sum of the squares of the terms minus 
twice their cross-product. 

The diagonal elements of the matrix X’X give the squares of the lengths 
of each person vector in the k-space, and the off-diagonal elements give the 
scalar products of each pair of person vectors in this k-space. A measure of 
Euclidean distance between persons is thus given by the indicated operation 
on the matrix X’X, when the matrix X describes the several persons with 
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respect to a rectangular Cartesian system. This operation may be formulated 
in matric terms. For any pair of persons, 7 and j, this operation consists of 
pre-multiplication by a row vector, /, of this form: 


Persons 
(0 0 0 +1 0 0 0 


followed by post-multiplication by the transpose of this vector. For example, 
the square of the distance between persons a and b is given by 


[97 45 76 66] 
[+41 -1 0 45 25 40 30] | 
76 40 65 51 0 
66 30 51 OJ 


which is equal to (97 — 45) — (45 — 25) = 32, as before. Thus, the square of 
the distance between any pair of persons is given by a product of matrices 
that may be written 


EX'XE’ = D’. 


This D’ may be interpreted in more than one way, depending upon how one 
defines uncorrelated variables. This problem must now be considered. 


Uncorrelated Variables 


In order to show the nature of this problem, let us define, loosely, an 
uncorrelated form as 


TZZ'T’ = a diagonal matrix, 


where Z is a given matrix of data and T is a transformation. To avoid dis- 
cussing at this point certain problems of the form of the data, let us specify 
that Z consists of deviation scores that have been systematically reduced 
so that the variance of each row of Z equals unity. In other words, the data 
are taken in a form such that ZZ’ is the conventional correlation matrix 
with units in the diagonals. Later, questions concerning the form of the 
data will be raised and Z will be shown to be a product of matrices, one of 
which is the matrix of raw scores. Now the definition of an uncorrelated 
form given above does not specify the non-zero elements in the diagonal 
matrix; in other words, it does not specify the weighting to be given each of 
the uncorrelated variables. Two systems of weighting appear to have special 
merit; one is given by 


TZZ'T’ =I 


} 
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and is loosely related to the Mahalanobis distance measure for groups. The 
other is given by 


TZZ'T’ = Di, 


where Dj designates the matrix of non-zero latent (or characteristic) roots of 
the matrix ZZ’; this definition is associated with the Cronbach-Gleser D* and, 
as they point out, with the Pearson CRL. These two weighting systems give 
different results; one weights the uncorrelated variables, i.e., the factor 
scores, equally; the other weights the factor scores in proportion to the 
size of the square roots of the latent roots. 


The Inverse Transformation 


First consider the transformation that yields equally weighted uncor- 
related variables. It always is possible to resolve Z into a product of principal- 
axis factors and factor scores; thus 


Z = GD,P’, 


where G is a set of orthogonal columns constituting the characteristic vectors 
(in standard form) corresponding to the non-zero latent roots of ZZ’, D, is the 
matrix of positive square roots of the non-zero latent roots of ZZ’, and P’ is 
the set of factor scores with unit variance. There now are available these 
generalities: G’G = I, , where r is the rank of Z, and P’P = I, . GG’ isa 
pre-multiplication unit for Z and consequently a right and left unit for ZZ’; 
this is true regardless of the rank of Z. Similarly, PP’ is a right and left 
unit for Z’Z, and in fact the columns of P are the characteristic vectors of 
Z'Z corresponding to the non-zero roots of Z’Z, which necessarily are the same 
as the non-zero roots of ZZ’. For a summary, see Harris (4). 
These properties give a solution for 7. Set 


X = D;'G'Z = P’. 
Then XX’ = J, , and the transformation is 7 = Dx'G’. This principle of 
transformation gives as the uncorrelated data, X, the principal-axis factor 
scores of the persons with unit variance. With Z as defined above, these factor 


scores have means of zero. For the purpose of determining distances between 
pairs of persons this transformation leads to 
EX'XE’ = EPP'E’, 
that is, the form X’X is simply the form PP’. The effect, then, of this trans- 
formation is to give distance measures that are functions of the equally- 
weighted principal-axis factor scores. 
It is conventional to ask concerning the solution of any problem in what 


sense, if any, the solution is unique. Consider PP’. If Z’Z is non-singular, 
as it might be, for example, if N, the number of persons, is less than k, the 
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number of variables, then PP’ necessarily is simply the identity matrix, J. 
This means then that, using this transformation principle, studying relatively 
few persons with respect to relatively more linearly independent variables 
always yields the same numerical value for the distance between every pair of 
persons, regardless of what set of variables was used. If N is greater than k, 
then Z’Z necessarily is singular and the matrix PP’ is a singular idempotent 
matrix that is a multiplication unit for a group (in the algebraic sense) of 
singular matrices. This also means that there are many sets of data that will 
yield the same matrix, PP’, when the inverse transformation is made and the 
resulting X’X calculated. In other words, under these conditions the distances 
between pairs of persons are not unique to a given set of data. For example, 
if we pre-multiply any given set of data, Z, by a non-singular matrix we 
leave invariant the matrix PP’, but not, of course, P itself. Since distance 
measures computed from data that have been transformed by this inverse 
transformation are functions of PP’, this lack of uniqueness to the given data 
should be recognized. It also should be recognized that these comments assume 
that the inverse transformation is developed from the data in hand rather than 
from data for a different group, such as a normative group. If the latter is 
done, these statements do not hold. 

A direct, but quite arduous, calculation procedure would be to factor 
either ZZ’ or Z’Z in order to determine P, the matrix of factor scores. Another 
calculation method results from the identity 

PP’ = 
provided, of course, that ZZ’ is non-singluar. Still another calculation 
procedure is to utilize the principle of Rao’s transformation (6) to develop 
a triangular matrix, C, such that 
CZ = X. 
Then 
X'X = 2'C'CZ, 
where C’C is the inverse of ZZ’, provided it exists. It is interesting to observe 
that this latter method works even though ZZ’ is singular. Adopting a new 
notation, 
C’C = = 
and, 
Z'(ZZ')'Z = PD,G'GD,’GD,P’ = PP’, 
as before. This analysis uses the principle that if ZZ’ is singular, then there. 
exists a matrix (ZZ’)~', which also is singular, such that 


ZZ"(Z2')* = = GG’, 


where GG’ is the symmetric idempotent matrix that is a unit for multiplication. 
within the group. The factored form of (ZZ’)~' is then seen to be GDx°G’. 
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Principal-Axis Transformation 


The inverse transformation discussed above gives as the uncorrelated 
form of the variables a diagonal matrix whose non-zero entries each equal 
unity. In other words, the inverse transformation gives uncorrelated variables 
of equal (unit) variance. As noted above, a different transformation may be 
defined by requiring that the transformation matrix, 7, be such that 

1ZZ'T' = Di, 
where, as before, Dx is the matrix of non-zero latent roots of ZZ’. This is 
the familiar canonical form of a symmetric matrix; as such, it is a well- 
known definition of an uncorrelated form. It differs from the inverse trans- 
formation in that the transformed variables are now weighted unequally, 
rather than equally, these unequal weights being given by the square roots 
of the roots of the characteristic equation of the symmetric matrix ZZ’. 
If this is chosen as the uncorrelated form, then the transformation is accom- 
plished by setting 
X = G@’Z = D,P’. 
It then follows that XX’ = Di, as required. In order to determine distances 
between pairs of persons, calculate 
EX'XE' = EPD;P'E’ = EZ'ZE’, 
since GG’ is a unit for multiplication, as described above, regardless of the 
rank of Z. In other words, choosing the canonical form of ZZ’ as the uncor- 
related form of the variables gives distances between pairs of persons as a 
function of the entries in Z’Z. The calculation procedure obviously requires 
no comment. For distance measures this solution is unique to the given set 
of data; this is related to the fact that the canonical form of a symmetric 
matrix is, under certain rather general conditions, itself unique. 

It is apparent that many different diagonal matrices might be chosen as 
the uncorrelated form of the variables. A choice of a transformation must 
specify the non-zero elements of this diagonal matrix, i.e., it must specify 
the weights to be assigned to the variables in uncorrelated form. Two such 
choices that are meaningfully related to common statistical concepts are the 
identity matrix, J, associated with the inverse transformation, and the 
diagonal Dx , associated with the canonical form of a symmetric matrix. 
For both these transformations distance measures for pairs of persons are 
functions of factor scores; using the inverse transformation, the factor scores 
are weighted equally, whereas using the principal-axis transformation they 
are weighted unequally. 


The Form of the Data 


Consider now a matrix of data, Y, that consists of the observed measures, 
i.e., the raw scores. In order to transform these data into the form of Z, 
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first write 
YL = [y], 


with [y] the matrix of deviation scores. For any Y, which is of order k by N, 
the matrix L which accomplishes the transformation of raw to deviation 
scores is 


N N N N 
= 
N NN N 
L N N N N 4 


The matrix L is square, symmetric, of order N, and of rank (VN — 1). Direct 
multiplication verifies that L = L’, i.e., that L is idempotent. The matrix 
L is a quadratic form with roots of unity and is an example of the type of 
matrix referred to in Cochran’s theorem (1). Further, Z is a multiplication 
unit for any vector consisting of N terms that sum to zero; the row vector E 
employed earlier is such a vector. Note that what is being done here is to take 
an operation that is ordinarily considered to be an additive one and to write 
it as a multiplicative operation; this becomes a useful tool in the analysis 
of certain relationships among matrices. It is now possible to write 


SYL = Z, 


that is, to pre-multiply the deviation scores by the appropriate diagonal 
matrix to secure the form Z. The diagonal matrix is, of course, one in which 
each of the non-zero elements is given by the reciprocal of the product of 
the square root of N and the standard deviation of the variables. 

Using this definition of Z and noting that S is a non-singular matrix, 
that L is idempotent, and that L is a multiplication unit for L, the distance 
between any pair of persons under the inverse transformation becomes 


EZ'(ZZ')"ZE' = EY"(YLY’)"YE’. 


(A method of calculation when ZZ’ is singular was suggested above.) Reduced 
to these terms, then, this distance measure is a function of the raw scores 
and of the inverse of the matrix of variances and covariances. An analogous 
reduction of the Cronbach-Gleser measure gives 


EZ'ZE' = EY'S’ YE’, 
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showing that it is a function of the raw scores and the reciprocals of the vari- 
ances of the variables, since the scalar 1/N affects all pairs in the same way. 
Clearly, the two measures are identical if, and only if, YLY’ is a diagonal 
matrix of variances. It also is evident that 


EY’S*YE’ # EY’YE’, 


that is, that any change in scale for one or more variables affects the Cron- 
bach-Gleser measure. This is not true for the measure derived from the 
inverse transformation. 

Other modifications of the form of the data might be explored using these 
techniques. One such modification that is of interest is secured by centering 
Y by columns rather than by rows; we may write MY to designate this, 
where J is of the form of L with k substituted for NV. The matrix MY is such 
that each column sums to zero, i.e., the “elevation” has been equalized for 
all persons. Then by choosing the appropriate diagonal matrix, Sy , the 
product S,Y’MYSy yields the intercorrelations of the persons. If the 
principal-axis factors of this matrix are taken as the descriptions of persons in 
terms of uncorrelated variables, then the distance between any pair of persons 
is simply ES, Y’MYSyE’. Obviously, 


ESyY'’MYSyE’ EY'M YE’, 


which merely states that any change in scale for the columns affects this type 
of measure. 

Finally, MYL is a double-centered matrix, i.e., it sums to zero both by 
rows and by columns. Since matrix algebra is a linear associative algebra, it 
makes no difference whether one first forms MY and then centers by rows, or 
first forms YL and then centers by columns; the resulting MYL is the same in 
either case. Now, since L is a multiplication unit for any £, it can be seen that 


EY'MYE’ = ELY’'MYLE’. 


This identity emphasizes the point that if the data are centered by columns 
then double-centering does not alter this particular distance measure. 

Table 1 gives the algebra of distance measures developed on the basis 
of the two principles discussed above for various forms of the data. It is 
evident that not all the distance measures listed there would be judged 
to be meaningful ones; it also is evident that some of them are duplicates. 
The table is probably of primary value in demonstrating that this system of 
analysis makes explicit a range of choices of distance measures and provides 
a method of specifying the choice that a worker may make. Table 2 gives 
illustrative scores on three tests for five students. These scores have been 
used to compute the illustrative distance measures between pairs of students 
that are given in Table 3. 
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TABLE 1 
Distance Measures for Various Forms of the Date 
Principal-axis 
Porm of Data Transformation Inverse Transformation 
EY’YE’ 
EY’YE’ EY’(YLY’ 
SYL EY’S*YE’ EY*(YLY’)-l YE’ 
uy EY “MYE’ EY 
MYS, ES, 
MYL EY’MYE’ EY’M(MYLY “M)-2MYE’ 
TABLE ? 
Illustrative Distance Measures between Pairs of Students 
EY’ YE’ EY’ (Yy’)~lye’ 
a a d a 
TABLE 2 
by 11 Dd] 32 
Illustrative Scores on Three Tests for Five Students 
SSS c 90 125 c 52 1.06 
Student q 
a b d 69 98 169 af.22 1.54 .73 
1. Spelling | 18 21 1s 54 62 e| 1.56 
2. Us 
Usage Ly 43 39 51 EY’S?YE’ EY’ 
Vocabulary} 18 1 
bd} 1.07 bd] .37 
9.03 11.82 1.6 
8.08 11.61 1.80 2.00 
1.82 5.52 11.25 8.33 65 1.99 1.92 1.89 
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A new algebraic formula is derived for estimation of the discriminal 
dispersion in the method of successive intervals. The legitimate use of the 
formula requires that as many normal deviates as possible be present in the 
matrix. For this reason, it is recommended that deviates corresponding to the 
interval (0.01, 0.99) of the cumulative proportions be used, instead of those 
corresponding to (0.05, 0.95), the interval used by Edwards and Thurstone. 
Computations on data published by Edwards and Thurstone showed that 
when adjustment was made for variability in dispersions calculated by the 
formula of this paper, a reduction of fifty per cent in mean absolute discrep- 
ancy was produced. Since the formula is easy to use and avoids the disadvan- 
tages of its predecessors, it should have fairly wide applicability in psycho- 
logical research. 


The method of successive intervals is perhaps the most practical way 
of obtaining rational scale values of stimuli along a unidimensional psycho- 
logical continuum not simply correlated with any physical variable. The 
data may be provided by any procedure in which judges classify stimuli into 
a finite number of mutually exclusive and exhaustive classes which are ordered 
along some dimension. 

When the number of stimuli is small, they may be ranked without 
ties, so that the number of classes equals the number of stimuli. When the 
number of stimuli is large, they may be either sorted into piles or rated on a 
rating scale. With either of these procedures, the number of classes may be 
considerably less than the number of stimuli. For adequate reliability, a 
large sample of judges is needed when any of these techniques of gathering 
data is used. 

Although successive intervals was developed by L. L. Thurstone, its 
first published account was given in a paper by Saffir (8) in 1937. Recently, 
papers by Edwards (4) and Edwards and Thurstone (5) have presented a 

*This research was supported in part by the United States Air Force under Contract 
No. AF 33(038)-25726 monitored by Air Force Personnel and Training Research Center. 
Permission is granted for reproduction, translation, publication, use and disposal in whole 
and in part by or for the United States Government. The writer is grateful to Dr. A. L. 
Edwards for a critical reading of an earlier version of this paper, and to Dr, L. H. Lanier 
and Dr. L. M. Stolurow for editorial advice on the present version, which was written at 
the University of Illinois. The editors of Psychometrika have informed the writer that 
H. J. A. Rimoldi and M. Hormaeche (7) have independently derived the sarne formula for 


the discriminal dispersion from a different set of postulates using the law of comparative 
judgment. 
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check on internal consistency which indirectly tests the applicability of the 
postulates to any particular set of data. This check now makes successive 
intervals a serious rival to the method of paired comparisons. The advantage 
of successive intervals over paired comparisons lies in its greater speed in 
collecting data. Empirical studies (4, 5) have shown that there is a linear 
relation between scale values obtained by these two methods. 

In any stimulus scaling method developed in the Thurstone manner, 
there are at least two important kinds of parameters, represented respec- 
tively by S, , the scale value of the jth stimulus, and a; , the corresponding 
discriminal dispersion. Although adequate computational techniques for 
estimating each S,; by the method of successive intervals have been pub- 
lished (4, 5), those available for estimating a; are subject to improvement. 

The first technique, developed by Thurstone and presented by Saffir 
(8), does not base the computation of each a; on all of the data. Also, it does 
not use a simple algebraic formula in the manner originated by Thurstone 
(9, 10) and further applied by Burros (2) and Burros and Gibson (3) for 
estimation of c; in the method of paired comparisons. It is interesting to 
note, therefore, that in a recent paper on successive intervals, Edwards and 
Thurstone (5) did not use the technique presented by Saffir for estimating 
the dispersions. Instead these writers used one published by Attneave (1). 
A critical examination of Attneave’s technique will be made later on in this 
paper. In the writer’s opinion, it does not have a rigorous basis. 

Perhaps the most rigorous approach to the problem is a least squares 
solution recently published by Gulliksen (6). Unfortunately, it is possible 
(although admittedly improbable) that negative estimates of the dispersions 
may be calculated by this technique. This sort of result could happen if the 
dispersions are exceedingly variable. A small positive dispersion could then 
be estimated as negative when his least squares solution is applied to the 
data. A related discussion of this problem of absurd results in paired com- 
parisons is presented by Burros and Gibson (3, pp. 63-64). 

Since the techniques published by Saffir (8) and Attneave (1) are ques- 
tionable, and the one by Gulliksen (6) conceivably may give absurd results, 
a new formula may be of interest. This paper, therefore, presents the deriva- 
tion of a simple formula for the estimation of ¢; in the method of successive 
intervals, which is similar to those previously derived for paired comparisons 
(10, 2, 3). The use of the formula will then be illustrated by means of further 
analysis of data presented by Edwards and Thurstone (5). 


Definition of Symbols 


R = postulated unidimensional psychological continuum with finite range 
arbitrarily divided into N class intervals corresponding to the steps 
on an N-point rating scale; 
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S = postulated unidimensional psychological continuum with unrestricted 
range corresponding to R; 
; = upper true limit of kth class interval of R; 
Si = corresponding upper true limit of kth class interval of S; 
Rk; = momentary estimate by a judge of the scale position of the jth 
; object on R; 
S; = corresponding momentary estimate of the scale position of the jth 
object on S; 
S; = scale position (mean and median of S;) of jth object on S; 
o; = disciminal dispersion or standard deviation of distribution of S;; 
Xi. = (Sk — ; 
2; = (S; — S;)/o; ; 
P;, = probability that R; < Ri. 


Postulates 


1. There exists a unidimensional psychological continuum (R) with 
a finite range, arbitrary units, and an arbitrary origin. 

2. There exists a corresponding unidimensional psychological continuum 
(S) with an unrestricted range, equal units, and an arbitrary origin. 

3. S = f(R), where the function is monotonic, increasing, and generally 
nonlinear. 

4. For object j, and corresponding to each observed momentary estimate 
(R;) by a judge on R, there exists a theoretical momentary estimate (S,) 
on S. The distribution of S; is normal with mean (and thus median) of S; 
and standard deviation a; . 


Basic Theorem 
Since {(R) is monotonic increasing, 
P;, = P(R; < Ri) = P(S; < S). 
But if S, < S{, then 
and (8, — < (Si — S))/o;, 


so that 
4,2 Xi 


by definition of these quantities. Therefore, 

Py = Xin) = GX), 
where G is the normal probability integral. Given each estimated value of 
P;, determined by the empirical frequency distribution of R; on R, therefore, 


the corresponding estimated value of X;, can be found from a table of the 
normal integral. These may be arranged in a matrix X. 
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Whenever P;, equals 0 or 1, the value of X;, is indeterminate. If any 
proportion is too near to either 0 or 1 (say, less than 0.01, or greater than 
0.99) the values of X,, are too unreliable to be recorded. Whenever a value 
of X;, is indeterminate or unreliable, it is omitted from the X matrix. 


Derivation of Formula for o; 


From the definition of X;, , it follows that 


Si = 8S; + (1) 
Similarly for X;, , 
8: + . (2) 
Therefore, 
S,+6Xn = + (3) 
and 
Xx = (S; — + (4) 


Equation (4) says that the jth row in the X matrix is theoretically a linear 
function of the ith row with slope of 


m . (5) 


Theoretically these two rows are perfectly correlated. 

Let V; and V; be the respective measures of variability, e.g., standard 
deviations or ranges, of the 7th and jth rows of X. Assuming perfect correla- 
tion, therefore, the slope of (4) is also equal to 


m= V,/V,. (6) 
Therefore, 

= V;/V; (7) 
and 

= 0;V;. (8) 


Thus, for any two stimulus objects 7 and 7 either side of (8) theoretically 
equals a constant, defined as 


= V; (9) 
Therefore, 
a/ V; (10) 
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In order to estimate a, a unit of ¢; must be chosen. This is an arbitrary matter. 
The simplest definition is that the unit is the mean of the sigmas of the n 
stimuli, i.e., 


= 1 (11) 
and 
=n. (12) 
Summing (10), and using (12), 
n= =a (1/V)). (13) 
Therefore, 
a=n/ (1/V)). (14) 


Thus, a is the harmonic mean of the values of V; , which are obtained em- 
pirically from the rows of the X matrix. After a is estimated from (14), each 
estimated value of o; is given by (10). 

Now that the new formula has been derived, it is possible to criticize 
Attneave’s technique. According to Attneave, this “assumes that the mean 
dispersion of stimuli represented in one dichotomy is equal to the mean 
dispersion of those represented in another; this assumption may be only 
approximately correct” (1, p. 340). A sufficient condition for this assumption 
is that all entries in the X matrix are present. When this is so, it follows from 
Attneave’s directions that the discriminal dispersion of any stimulus equals 
the ratio of the arithmetic mean of the ranges of stimulus X values to the 
range of the given stimulus. Equations (10) and (14) show, however, that 
the proper average for the numerator of the ratio is the harmonic mean, 
not the arithmetic mean. Even when Attneave’s assumption is known to be 
true, therefore, his technique is not strictly correct. It may sometimes give 
adequate results, however, if the arithmetic mean range and the corresponding 
harmonic mean are approximately equal. 


Application of Formula 


To save space, tables presented by Edwards and Thurstone (5) will 
not be reproduced. They provide the following relevant data: (a) their 
Table 1 (5, p. 172) gives the cumulative proportions P;, for ten stimuli 
rated on a nine-point scale; (b) their Table 2 (5, p. 173) gives the normal 
deviates X,, corresponding to the proportions in the closed interval (0.05, 
0.95). In order to reduce the number of empty cells, the writer entered into 
a copy of this table those additional deviates required to encompass the 
interval (0.01, 0.99) of the proportions. 
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Table 1 of this paper shows the results of the computations. V; is the 
standard deviation of the reliable normal deviates for stimulus j7 based upon 
N; values. The parameter a is then computed by (14) to be 


a =n/ >) (1/V;) = 10/8.332 = 1.20. 


Then each value of oc; is computed in Table 1 as 
a(1/ V;). 
TABLE 1 


Calculation of the Discriminal Dispersion (¢;) from Data 
Presented by Edwards and Thurstone (5) 


Ni V; 1/V; 
1 6 1.08 0.926 & 
2 7 1.26 0.794 0.95 
3 6 1.16 0.862 1.03 
4 8 1.25 0.800 0.96 
5 8 1.24 0.806 0.97 
6 6 1.52 0.658 0.79 
7 8 1.34 0.746 0.90 
8 7 1.48 0.676 0.81 
9 8 0.998 1.012 1.21 
10 8 0.950 1.052 1.26 

Sum 8.332 


*As rank order (j) of the stimulus increases, the scale value tends to decrease. 
**N; is the number of normal deviates corresponding to proportions in the interval 


(0.01, 0.99). 
***Presumably errors from rounding off decimals account for the departure ,of this 


sum from the theoretical value (10.00). 


Discussion 


The estimates of the dispersions calculated for successive intervals 
by the formula of this paper correspond roughly to the estimates of the 
same parameters of the same stimuli computed by Edwards and Thurstone 
(5, p. 177) by means of the method of paired comparisons. They reported 
that the latter dispersions ‘‘showed considerable variation, ranging from a 
low of .52 for stimulus 6 to a high of 1.32 for stimulus 10.” (5, p. 177). The 
corresponding successive intervals dispersions in Table 1 of this paper are 
0.79 and 1.26, respectively. 

Although Edwards and Thurstone (5, p. 177) reported comparable varia- 
tion in o; computed from their successive intervals data by Attneave’s 
technique (1), they noted the surprising fact that adjustment for variability 
of dispersions did not improve the goodness of fit measured by the mean 
absolute discrepancy of the proportions. Since this does not conform to usual 
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experience with paired comparisons, some additional computations made 
by the writer may be of interest. 

First, using the normal deviates corresponding to proportions in the 
interval (0.01, 0.99) and the dispersions previously computed by the new 
formula, the successive intervals scale values of these stimuli were com- 
puted by an algebraic technique. Since there was close correspondence 
with the scale values reported by Edwards and Thurstone, who used the 
interval (0.05, 0.95), no details about these computations need be given here. 

Then the mean absolute discrepancy was computed to be 0.0135. This 
is half of the value, 0.027, reported by Edwards and Thurstone when adjust- 
ment was made in successive intervals for variability in dispersions calculated 
by Attneave’s technique. It is concluded that the mean absolute discrepancy 
may be considerably less than that reported by Edwards and Thurstone 
when correction is made for variability in dispersions calculated by the formula 
of this paper. 

In order to fulfill the requirements of the formula, however, the number 
of empty positions in the X matrix should be reduced. The use of a wider 
interval of acceptably reliable proportions, i.e., (0.01, 0.99) instead of (0.05, 
0.95) will produce this desired result. The use of this wider interval is, there- 
fore, recommended. 

Since the formula presented here avoids the disadvantages of its pre- 
decessors but is easy to use, it should have fairly wide applicability in psy- 
chological research. : 
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The law of comparative judgment is applied to the successive intervals 
and graphic rating scale methods. A procedure for estimating the modal 
discriminal process and discriminal dispersion of the stimuli, as well as the 
value of the boundaries of the intervals on the continuum, is given. From the 
estimated values it is possible to determine the theoretical proportions and to 
compare them with the actual experimental proportions. The agreement be- 
tween these values is an indication of the adequacy of the assumptions made. 


The rationale and the system of computations described in the present 
paper developed from a suggestion offered by L. L. Thurstone in one of his 
courses at the University of Chicago. He suggested an interpretation of the 
method of successive intervals based on the assumption that, in the process 
of indicating preferences, a subject will compare the affective value of each 
stimulus with the affective value represented by the interval limits on the 
psychological continuum. 

In the present study stimuli were presented using three different pro- 
cedures: 


1. Method of successive intervals. The subject was presented with equally 
spaced intervals having reference to degree of interest in an indicated stimulus. 
His placement of a check mark in any interval was interpreted as indicating 
that his interest in that stimulus was greater than that represented by the 
lower limit of the interval and smaller than the interest represented by its 
upper limit. A pre-test on approximately 30 subjects demonstrated experi- 
mentally that the continuum could be defined unambiguously. 


*This article is the first part of a larger study conducted at the Laboratorio de Psico- 
logia, Facultad de Humanidades y Ciencias, Montevideo, Uruguay, during the years 1951 
and 1952. The authors want to thank Dr. L. V. Jones for his critical comments on the 
manuscript. The authors have been informed by the editors of Psychometrika that R. H. 
Burros (2) has independently reached the same analytic solution for the computation of 
stimulus dispersions. Dr. Burros has used a set of assumptions different from the ones stated 
in the present paper. 

tNow at Loyola University, Chicago, Il. 
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2. Multiple category method. This is a variation of the previous procedure. 
Subjects were instructed to encircle the word or sign (Yes, yes, ?, no, No) 
that best represented their interest in the stimulus. 


3. Graphic rating scale. Here the subject was asked to state his interest 
in each stimulus by placing a check mark on a straight line without intervals. 
The location of the check mark on the continuum was interpreted as indi- 
cating that the interest of the subject in the stimulus was greater than that 
represented by the points on the continuum located to the left of the check 
mark, and smaller than that represented by the points on the continuum 
located to the right of the check mark. 

In all the presentations it was assumed that the subject compared the 
value of the stimulus with the value represented by the different points on 
the continuum. 

Determination of L; 

Let S; (j = 1,2, +++ +++, m) represent the modal discriminal process 
for the jth stimulus, and o; (j = 1, 2, +--+ ,7, +++ ») the discriminal dispersion 
for the jth stimulus. L; (¢ = 1, 2, --+ ,7, -++ ,m — 1) is the modal discriminal 
process of the boundary between intervals 7 and 7 + 1, where there are m 
successive intervals, and d; (¢ = 1, 2, --- , 7, «++ , m — 1) represents the 
discriminal dispersion of the 7th boundary. d; is generated in a manner 
similar to that described by Thurstone (7) for the discriminal dispersion of 
the stimuli. 

It will be assumed that the stimuli are normally distributed and that, 
together with the interval limits, they can be located on the same psycho- 
logical continuum. Throughout the study it will be assumed that we are 
dealing with Thurstone’s case II (8), where several individuals made one 
judgment each. 

The origin of the scale will be defined as 


S; = 0, (1) 
and the unit of measurement will be 
= 1. (2) 


According to the law of comparative judgment (8) and to the previous 
assumptions it is possible to write 

L; — 8; = + — 2r;;0;d; (3) 

where X;; is the normal deviate corresponding to the proportion of times 

that stimulus 7 has been placed in a position less preferred than the point L; . 

r;; is the correlation between the subject’s judgment of stimulus j and the 


interval boundary 7. 
It seems defensible to assume that d; will be very small when compared 
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with o; and that the more precise the definition of the continuum the smaller 
the value of d; . This assumption seems to be corroborated by an unpublished 
investigation by L. V. Jones at the University of Chicago. The empirical 
evaluation of d; , in terms of the unit of measurement of (2), demonstrates 
the magnitude to be no greater than .05 and generally much smaller. Ignoring 
the value of d; , (3) becomes 

L; — 8; = Xijo; . (4) 


Adding and averaging (4) for all stimuli and keeping L, constant we 
have, using (1), 
X;;0;)/n = L; . (5) 
Determination of the Modal Discriminal Process for Each Stimulus 


According to (4) it is possible to have as many S; values as there are 
L; points on the continuum. Keeping S; constant and adding and averaging 
for all the L,; values, we obtain 


(SL, 6; — 1) = S;. (6) 
Determination of the Discriminal Dispersions 
Subtracting (4) for stimuli 1 and 2, we have 
| Si =X i194 
L; X i202 (7) 


Equation (7) is similar to the basic equation used in the scaling of mental 
tests (6) and may be written 


Xi, = X;j2(02/0,) + (S, — S,)/o, , where (S, — S,)/o, = K = constant. 


Thus, 
= X i2(o2/01) K. (8) 


There are as many X;, , Xi2, --: , Xi; , «++ , Xin Values as there are + 
points on the continuum. From these values n standard deviations, V; , 
may be computed by 


V; = Vim —1) > Xn — (DXi) /(m — 1). 


Performing the necessary operations, (8) can be written as 
Vv, = 


and consequently 
=c/V,, where = c = constant. (9) 
Changing subscripts of ¢ and V from 1 to j and adding all the resulting 


| 
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equations, 
n=c since ia; =n, 


and according] 
=n/ > (1/V)). (10) 
From (9) and (10) it is readily seen that the value of the disciminal dispersion 
for any stimulus 7 is given by 
o, =n/[V; (1/V))). (11) 
If the values of the dispersions are equal for all the stimuli, then, plotting 
(8), the slope of the line will be unity. If it can be safely assumed that the 


discriminal dispersons for all the stimuli are equal, then according to (2) 
their value should be unity. Consequently (5) and (6) can be written as 


L; = X;;/n, (12) 
and 8; = — 2). (13) 


Reproduction of the Original Experimental Proportions 
From (4) 


Xi, = (L; S;)/o; (14) 
If the discriminal dispersions are assumed to be equal, then 
X;; = (L; — 8§)). (15) 


The X,; values thus obtained should be transformed into proportions and 
these compared with the original experimental values. 

Equations (1) to (15) refer to true values. For the purposes of computa- 
tion a parallel set of equations can be written using sample values instead of 
true values. 

Experimental Results 


Subjects were asked to state their degree of “interest in knowing” 
certain people. The stimuli were names of men and women who were well 
known to the experimental group. (See Tables 2, 3, 4, and 5). 

TABLE 1 


Known Values Values of o, from 
Stimuli of Forma (11) 


° 
° 
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TABLE 2 


Method of Successive Intervals 


(Decimal points omitted.) 


2 2 3 4 5 6 T 8 9 

A* 000 006 000) Obl 065 176 206 318 188 

Roosevelt T 005 O07 O15 027 «086 162 186 294 218 
A 024 035 076 206 159 259 224 
Leonardo T 009 O11 020 034 096 163 17%  e7h 218 
Hitler T 070 190 
A 006 171 265 276 076 065 
Garibaldi T O34 O77 112 276 211 143 090 002 
A 018 065 100 065 194 176 
Marie Antoinette 49 038 056 075 169 209 165 417% O7% 
A 047, 094 O94 088 206 159 118 159 035 

Mussolini = 070 «#2055 
A 012 012° 012 053 088 235 182 229 176 
T 010 013 025 039 24111 16% 185 264 169 
Shale A 006 O12 018 006 055 129 212 288 276 
T 007 009 «025 «129 «#2155 
MA A 012 018 O47 065 200 253 194 188 02% 
San Martin Ol. O42 O71 196 273 196 161 029 
A 012 076 094 059 «#41159 176 153 12% 
A O41 065 082 312 188 141 obL 035 
Teabella T 054 055 083 110 222 228 135 095 018 
A 0592s «035 070 065 176 188 153 159 
T 058 059 075 162 189 «#172 092 
A 035 024 029 O41 129 «#4159 «#4182 276 12h 
A 000 018 02h o2h 065 ikl 159 300 270 
T 009 O10 019 028 079 «+1139 «#4156 269 
A 000 065 176 312 22k 188 006 
T 005 O13 027 169 272 22h 197 039 
A 006 012 o12 O47 076 106 153 294 294 
mapenece T 009 O10 O18 033 072 132 155 262 309 
A 018 018 ob7 029 «09h 182 159 229 224 
T 020 
A OI 053 165 194 288 224 
T 010 O11 «4022 032 
A 018 018 ob87 188 282 188 070 

T O17 023 O82 O67 17% 237 192 191 057 
A 053 O41 218 141 135 224 106 
= T O48 034 052 066 152 188 155 192 113 

A 02h 012 O82 170 170 188 212 

T 022 025 060 153 21h 182 208 
A 006 035 100 170 188 335 147 
Curie T 006 O10 018 033 099 «+178 196 261 179 
A 006 000 000 (018 86176) 6412-153 
T 002 206 

A 053 065 076 188 200 165 159 053 

Sarmiento T 051 06k (OST 
nertvar A 006 012 035 106 229 224 165 170 053 


ancy (per column) 


Standard Deviation = .025 


Average Discrepancy (total) = .017 


*A = Actual proportions; T = Theoretical proportions. 
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TABLE 3 
Graphic Rating Scale 
(Decimal points omitted.) 


1 2 3 4 5. 6 7 8 9 


A* 006 006 012 018 147 229 2h. 129 212 

Roosevelt T 005 008 013 02% 168 195 197 187 203 

Leonardo T 008 O11 O17 42030 «189 193 192 176 18h 

T 070 4036 056 5h 

T 03% 093 «383 

A 070 053 070 065 300 165 112 094 070 

A 18 088 053 112 270 129 «4112 «076 

Mussolini T 129 063 088 101 047 

A 02h o18 O41 265 196 165 129 

Joan of Arc T 018 020 028 O45 237 204 177 127 

A 000 o12 026 o12 14. 218 196 176 223 

Shakespeare T 002 003 007 O15 147 20h 224 211 167 

‘ A o12 012 O41 094 294 212 194 082 059 

T O14 019 O31 057 308 245 172 104 050 

‘ A 035 053 082 053 259 200 129 094 094 
Becquer T 052 037 olg 274 190 143 103 

A 053 «065 

A 070 ©4082 «849053 129135. 

Cleopatra - 092 050 061 075 274 167 120 075 

A ous 018 02h 029 259 188 223 153 088 

Dante T 016 O17 028 Ob5 250 219 180 139 © 106 

A 018 4012 129s 

T O16 O12 018 026 143 148 166 330 

A 006 206 223 123 123 

Cervantes T 006 O12 O31 218 227 165 119 

A 00 O18 12 159-123) 347 

T O10 O10 023 135 145 155 17h 334 

‘ans A 029 O12 O41 024 159 19% 1h? 170 223 

A 018 018 «46018 22359 

Michelangelo T O13: O1l 019 029 16h 165 166 172 261 

A 018 «6041 

T 028 030 067 «3517 092) 

Genahi A O41 065 Ob1 O76 270 165 100 123 118 

T 057 Oh 254 180 139 112 109 

Biben Berio T 032 030 Ob 062 284 208 15% 112 078 

A 006 4012) «60012 

eaten Curie T O10 O12 O21 036 219 210 1 1 138 

A 006 000 O47 «270 206 153 19% 

T 007 

A 059 059 053 070 282 188 100 106 082 

Sarmiento T 053 O73 273 179 «133 095 082 

A 024 O35 O41 065 274 206 100 112 

average Discrep- 006 010 010 O14 02h 028 021 


ancy (per column) 


Standard Deviation = .022 Average Discrepancy (total) = .016 


*A = Actual Values T = Theoretical Value. 
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The stimuli were presented in four different ways: paired comparisons 
(15 stimuli), successive intervals (25 stimuli), multiple category method 
(15 stimuli) and graphic rating scale (25 stimuli). Eight arbitrary interval 
boundaries were superimposed on the continuum to score the results obtained 
by using the graphic rating scale method. The 15 names included in the 
paired comparisons were common to all the other methods. Instructions to 
the subjects stated clearly that the continuum varied from “extreme lack 
of interest’’ to ‘extreme interest’’ in knowing the persons indicated by the 
stimulli. 

The experimental population consisted of 170 adults, of both sexes, 
most of them enrolled in teacher training institutions in Montevideo, Uruguay. 
The tests were administered to groups of 20 to 30 subjects. To check the 
accuracy of formula (11) a fictitious example with known ¢; values was 
prepared. The values obtained by using formula (11) and the real values of 
o; are compared in Table 1. 

The following operations were performed: a) Frequencies and correspond- 
ing proportions were obtained for all the cells, (the actual values for these pro- 
portions are given in the upper portion of the cells in Tables 2, 3, 4 and 35). 
b) Cumulative proportions were calculated and the corresponding normal 
deviates determined. c) The L; and S; values were computed using formulas 
(12) and (13), Tables 6 and 7. d) The values of o; were determined by means 
of formula (11), Table 6, as follows: 7) from the X;; values the V,; values 
were computed, 77) the sum of all (1/V;) values was determined, 777) noting 
that in formula (11) : BF (1/V;) and nm are constants, for a stimulus j the 
value of o; was obtained by finding first the value of V; and then applying 
formula (11). e¢) Improved estimates of L; and S; were obtained by means 
of formulas (5) and (6), Tables 6 and 7. f) The original proportions were 
reproduced using formula (14). (These values are given in the lower portions 
of the cells in Tables 2, 3, 4 and 5). The paired comparison data were analyzed 
using Thurstone’s cases III and V, as described by J. P. Guilford (3). 

Figure | indicates that the relationship between the S; values obtained 
by means of the different procedures here described may be interpreted as 
linear. The slope of the best-fitting line is an indication of the relative dis- 
persion of the S; values in the two procedures. The paired comparison pro- 
cedure gives the maximum scatter while the method of successive intervals 
reduces this scatter to a minimum. This may be due to the actual manner of 
presenting the stimuli. It should also be remembered that the best determina- 
tion of the S; values by using the paired comparisons method implies an 
estimation of the correlations between stimuli. As it will be shown in a future 
paper there are reasons to believe that some of the stimuli used in this study 
have substantial correlations among themselves. Consequently, this may 
explain some of the discrepancies found in this study. Our results are in 
agreement with those reported by Hevner (4), Saffir (5) and Edwards (1). 
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The procedure employed to deal with extreme X values is as follows: 
a) Cells with cumulative proportions above .990 or below .010 were deleted. 
b) The X values for the remaining cells were calculated and the average for 
each column was found (Table 8). c) In Table 8 the difference between the 
averages for successive columns was found; for instance, the difference 
between columns 3 and 4 was .260. d) This average was used to determine 
the estimated value corresponding to the deleted cells in column 3; for instance, 
for the stimulus Pasteur the new value was —1.977 + (—.260) = —2.237, 
and for Roosevelt it was —1.935. A similar procedure was followed to com- 
plete all the missing cells in the table. 

The estimated values in Table 8 are given in parentheses (16 values 
out of 225 for Table 8, 10 out of 225 for graphic rating scale, and 1 out of 
75 for multiple category method). As a final check on the operations remember 
that >> S; = 0, and }> o; = n. 

The theoretical proportions obtained using formula (14) should agree 
closely with the actual proportions provided the assumptions used in the 
development of the method are substantially correct. At the bottom of 
Tables 2, 3, 4 and 5 the standard deviation and average discrepancies between 
the actual and theoretical proportions are given. It is clear that the S; and 
a; values obtained in the manner indicated in this article reproduce the 
experimental values satisfactorily. Notice that the reproduction of the 
experimental values using the method here described is better than the one 
obtained using the paired comparisons method. 
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The diadic relationships existing in a group can be defined in terms of 
the members’ choices, rejections, and their perceptions of being chosen and 
rejected. The number of possible distinct diads is 45. Formulas are given for 
computing the expected frequency and variance of the different diadic forms 
expected, when certain random factors are taken into account. These values 
must be known if the operation of factors other than the specified random ones 
is to be studied. Values obtained from two models with different assumptions 
are compared with empirical values. A simplified treatment is possible for 
groups with ten or more members. 


The student of interpersonal processes often needs to describe and 
classify in some useful form the relationships between individuals. One 
such classification is given by relational analysis (2), a method developed 
in conjunction with a series of studies in interpersonal perception. In this 
classification the relationship between two persons is described in terms of 
the feeling each has for the other, and the perception each has of the other’s 
feeling. More specifically, each member of a well-acquainted group is asked 
to select those he likes most and those he likes least, and also to guess who 
likes him most and least. This procedure yields a simple but useful description 
of the relationship existing between each of the N(N — 1)/2 pairs in the 
group. 

Since each subject S; can choose, reject, or omit any other subject S; , 
and can feel chosen, rejected, or omitted by him, nine arrangements are 
possible of S,’s feelings and perceptions regarding S; . We shall define a diad 
between S,; and S; as any one of the nine possible arrangements of selections 
of S; , combined with any one of the nine possible arrangements of selections 
of S; , without regard to order. The number of possible distinct diads is 45. 

If S,’s feeling toward S,; be denoted by 0 for like, 1 for omission and 
2 for dislike and, if S;’s predictions of S;’s feeling toward him be denoted by 

*The present problem emerged from research undertaken as part of a project in 
interpersonal perception being carried out at the Laboratory of Social Relations at Harvard 
with the financial aid of the Office of Naval Research (Task Order N5ori—07646). 


tNow at Columbia University. 
tNow at Johns Hopkins University. 
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0 for like, 1 for omission and 2 for dislike, then we can represent some of the 
45 possible relationships as follows: 


S; S; S; S; 
(11) (11) (00) (00) 
(01) (11) (00) (22) 
(00) (01) (22) (22) 


Legend: the first digit in each bracket cor- 
responds to the feeling, the second to the 
perception. 


Some of the possible diads are well integrated, positive, and realistic; 
others involve contrary feelings and mistaken perceptions; still others indicate 
a well-developed negative, and recognized, mutual orientation. 

Psychologically important features of a group can be described in terms 
of the frequency of occurrence of the various diads. It is apparent, however, 
that given the number of choices, omissions, and rejections, and the number 
of perceptions of choice, omission, and rejection made by each member of a 
given group, each diad may be expected to occur a certain number of times by 
chance alone. To interpret observed data we must know something of these 
chance distributions, so that we will not attempt to give a psychological 
interpretation to data which can be explained by the operation of chance 
alone. When we know which specific diads occur from group to group with 
greater or less than chance frequency, then we can formulate hypotheses 
about the possible non-chance factors at work. For these reasons it is important 
to be able to state the expected frequency of occurrence and the variance 
of each diad type in a group of given size for an assumed chance model. 
In previous work (3) estimates of these quantities were obtained by construct- 
ing a Monte Carlo robot “group,’’ which was set to match the real group 
man by man in the number of choices, omissions, and rejections made and 
in their respective perceptions. This is clearly an unsatisfactory and inefficient 
method if it can be replaced by a simple mathematical formula. 

The purpose of this paper is to present a model in terms of which we 
can estimate the expected chance frequency and variance of the various 
diads. It should be borne in mind that the distribution of such frequencies is, 
probably, often more Poisson than normal. 


I. The Model 


Several possible “chance” models are conceivable, depending on what 
we choose to regard as chance. The first one we shad examine corresponds 
to the case in which the members of a group are regarded as automata, 
randomly allocating their selections according to fixed probabilities of 
choosing, rejecting, or omitting every member of the group. Three other 
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assumptions are made. First, statistical independence is assumed among 
the different choices, and between the choices and guesses, made by any 
individual. Second, the choices and guesses of any subject are assumed to 
be independent of those made by any other subject. Finally, we assume each 
subject may not choose or guess the same other subject more than once. 

For this model, in other words, we assume that the chance occurrence 
does not include the operation of any psychological factors except those 
which govern the relative frequencies of the choices and perceptions. In 
section III we shall discuss a modification of this, in which we assume an 
S’s perceptions to be conditioned by his choices and rejections. 

Let us now proceed with the derivation of the expressions for the expected 
frequency and variance of each of the diad types. Let S;’s feeling toward S; 
be denoted by 0 for like, 1 for omission, and 2 for dislike. Let S,’s prediction 
of S;’s feeling toward him be denoted by 0 for like, 1 for omission, and 2 for 
dislike. S,’s statement of his relationship with S; will be written (k,k.);; . 
Then a diad may be denoted (k,k2);;(kik3);; , where k, = 0, 1 or 2 etc., and 
since we do not consider the order, the diads (k,k.) ;;(k{k3);; and (k{k3) ;;(kik2)«; 
are identical. We will sometimes distinguish between them for computational 
reasons, but in general we shall denote either of these diads by (k,k.) (kik). 

For this model we have assumed that each S; has a fixed probability 
P,(k,) of liking, not mentioning, or disliking each (other) S; and that this 
P,(k,) is independent of j; similarly S; has a fixed probability Q;(k3) of 
predicting these feelings on the part of each S; , and this is independent of 
j and of P,; . 

Now let X;; be a random variable which assumes the value 1 if the diad 
between 7 and j has some specified value (k,k.)(kik3), and is 0 otherwise. 
Then 


is a random variable representing the frequency of occurrence of this specified 
diad in the group. (The following formulas are readily generalized to situations 
in which any fixed number of categories of questions are answered by the 
S’s and for which the number of possible responses in each category need 
only be required to be finite. However, many more than two categories 
with three or four responses each are not very practical.) Since X;; are all 
independent, 


HX) VMK) 


t=1 j=1 


and 


E(X;;) (k1)Q (kz), 


| 
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sO 


And similarly 
var(X) = 3 var(X ;;) (t # j) 
and 
var(X,;) = E(X,,)[1 — E(X;,)], 
so 


var(X) = E(X) — Pi) Qike) 
Table 1 shows the observed number of choices, omissions, and rejections, 


and the number of perceptions of choice, omission, and rejection given by 
each of the members of a ten-man group. The data used as an example were 


TABLE 1 TABLE 2 
Observed Frequencies of Different Feelings Observed and Expected Frequencies of Congruous 
and Perceptions in a Ten-Man Group and Non-Comgruous Diads; Model with Perception 
Not Contingent upon Feeling 
Sub-| Feeling Perception 
ject Diad Type Observed Expected 
i k=O kt= kt=1 kt=2 
a 3 > i 3 i ; Congruency 20 6.87 
Unilateral 
3 6 3 3 2 
i 2 6 i Congruency 22 21.37 
No 
> 2 2 
2 z Congruency 3 16.76 
é 
9 4 5 2 2 5 2 
10 3 5 1 > 7 1 
TABLE 
Observed and Expected Frequencies of Congruous 
and Non-Congruous Diads; Model with Perception 
Contingent upon Feeling 
TABLE 3 
Conditional Probabilities Q(ko|k,) Diad Type Observed Expected 
for the Ten-Man Group 
Bilateral 
ky Q(1]k,) Q(2] Congruency 20 17.63 
Unilateral 
fe) 0.63 0.35 0.02 Congruency 22 21.56 
No 
1 0.24 0.67 0.10 Congruency 3 6.07 
2 0.07 0.40 0.53 Chi square = 1.86 d.f, = 2 p 20.40 


obtained at the end of the last meeting of a series of twelve sessions con- 
ducted by a psychoanalyst. The nature of the meetings was a modified form 
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of group therapy where the members met to ‘discuss principles of group 
psychology particularly as these relate to self understanding.”” The procedure 
consisted of asking the members to indicate those others in the group they 
“liked most” and “least”? as well as to guess who would name them as liking 
them ‘‘most’’ and “least.” 

Analysis of the data in terms of the particular composition of each of 
the N(N — 1)/2 diads gives the observed frequency of each diad type and, 
in terms of these, describes the group. For example, the diad (00) (00) has an 
observed frequency of six, while (02)(20) does not occur; the figures below 
show that these frequencies are different from the expected value predicted 
by the chance model. 


Diad Observed Expected Variance 
(00) (00) 6 0.48 0.46 
(02) (20) 0 0.50 0.50 


The first diad, in which both subjects like each other and predict being 
liked by the other, occurs more often than expected by chance; the other, 
in which feelings are not mutual but are accurately predicted, occurs less 
often than expected, but not significantly so. 

In general we have found that there is a significant discrepancy between 
observed frequencies and those predicted by this chance model. This indi- 
cates that there are factors operating other than those we have assumed in 
this model, and the differences do suggest the nature of some of these factors. 

Let us further exemplify the use of the model. It is quite apparent that, 
in general, the feeling we hold for a person is congruent with the feeling we 
perceive that person holds for us. This tendency is quite apparent in all of 
our data. Thus member S; tends to choose and feel chosen by S; , or to dislike 
and feel disliked by S; , etc. Is this tendency sufficiently consistent that 
diads containing such congruencies between feelings and perceptions would 
exceed chance, while others would fall below chance? The present model 
permitted us to test such hypotheses by supplying us with an acceptable 
chance baseline. The data for the ten-man group mentioned above will be 
used to illustrate this point. We will separate the diads into three groups. 
In the first we will put ali diads in which feeling and perception are the same 
for both members: (00)(00), (00)(11), (00)(22), ete. In the second, we will 
put those diads in which this is true for only one member: (00)(01), (00) (12), 
etc.; in the third we will put all diads in which this holds for neither member: 
(01)(12), (10) (21), ete. If our conjecture is right, the first class should contain 
more cases than expected, and the third class fewer than expected, on the 
basis of chance alone. The figures presented in Table 2 show that the differences 
are as predicted; the probability of this occurring with the chance model is 
less than 0.001. 
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II. Model with Perceptions Dependent upon the Subject’s Feelings 


We may now modify the basic chance model by incorporating various 
hypotheses about the group, to see whether these additional factors will 
explain the observed results. 

We have said above that, in all groups, we have observed a strong ten- 
dency for a member to predict that others feel toward him whatever he feels 
toward them, and we have exhibited this tendency in one group. It seems 
reasonable to ask whether this tendency alone accounts for the deviation 
from chance. We shall, therefore, investigate how well a model with this 
modification accounts for the data. 

We shall assume again that each S responds in an independent manner, 
with probability P;(k) of liking, omitting, or disliking any other subject. 
However, we shall now assume that this choice conditions his prediction of 
another’s feeling toward him, so that his probability of predicting a given 
response on the part of another member is not Q;(k’), as before, but is Q;(k’ | k). 
The expressions for the expected value for the occurrence of any diad and 
the variance take similar forms to those presented above, with this condi- 
tional probability used for Q; . In the case of the group used here as an 
example, we do not have sufficient data to estimate the conditional prob- 
abilities individually for each member, so we shall use one set of such 
probabilities Q(k’ | k) for all the members, estimating the values from the 
data for all members combined. This simplification is not necessary in general 
but will be used for the example. With this assumption, the expected fre- 
quency of occurrence of a given diad reduces to 


E(X) = [2Q(ke | | p> P(k,) Pills) 


and the expression for the variance is similarly simplified. 

For our group, the conditional probabilities observed are shown in 
Table 3. If we now combine the diads as we did in Table 2, and compare the 
frequencies observed and the frequencies predicted using the conditional 
probabilities, we can observe a striking improvement in agreement. (Cf. 
Table 4 and compare with Table 2). 

Using the chi-square test, we see that there is a probability of about 
0.40 that the value of chi-square observed would be exceeded if the hypothesis 
were true. We can on this evidence neither accept nor reject the hypothesis 
that the observed frequencies of these diad types are accounted for by a 
chance model with predictions conditioned by feelings; but the improvement 
in fit is striking and suggests that a large part of the observed distribution of 
diad types is due to such contingency. This example illustrates the use of 
such baseline models in the study of the meaning of the observed frequency 
distribution for the diad types. 

It is obvious that other hypotheses about the group could be tested by 
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constructing a similar model and examining the observed frequencies to deter- 
mine how much of the variation is accounted for by such a model. The 
principle in ali cases is the same; a model is constructed which assumes that 
the members of the group are automata acting at random, with probabilities 
governed by the particular hypotheses at hand. The expected frequencies 
obtained from this model are then used to investigate the group and to deter- 
mine whether we have reason to believe that other psychological processes 
are at work beyond those assumed in the model. These hypotheses must be 
chosen with care, however, in order to yield a model which is mathematically 
tractable and which leads to a practical amount of computational labor. 
III. Simplifications for Large Groups 

In the models developed above, we have allowed the probabilities P, 
and Q; to be different for each member of the group. This leads to lengthy 
calculations for large groups. For groups larger than 10, however, we may 
introduce a simplification which greatly reduces this labor by using the mean 
value over all members of the group for the value of P; ; thus each member is 
described by the same probabilities; thus summations are no longer necessary. 
In the case of the first model mentioned above, if the mean value of P;(k,) 
is denoted by P(k,), and the mean of Q;(k.) by Q(k.), we may then write 
the expected value E(X) as 

E'(X) = [nv — Qk) Q(k2)], 


and the expression for the variance is similarly simplified. 
Let us examine the error involved in this approximation. Let 


A(X) = P(ki)Qi(k2) — nP(k:)Q(ke) 


and ANX) = — 
and 


B(X) = Q (kes) — - 


Then it can easily be shown that if EH(X) is the expected value previously 
calculated using the individual probabilities, and E’(X) is the expected 
value given above, 

E(X) — E'(X) = 3[A(X) A(X) + 

+ — B(X)]; 

and so if we use D(X) = [E(X) — E’(X)]/E’(X) as a measure of the error, 

A(X) A’(X) A(X) 

— B(X) 
(n — 1)P(ki)Q(k2) — 1) 


| 

| 
bs 
- 
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Now it has been found from experience that A(X)/[P(k,)Q(k.)] and A’(X)/ 
[P(k1)Q(k3)] are less than 2, and in almost all cases very near 1 for the groups 
encountered in practice. B(X)/[(n — 1)P(k,)Q(k2)P(k1)Q(k2)] is less than 1, 
and in almost all cases less than 1/2; so in practice this error D(X) is less 
than 5/n for n greater than 5. In almost all cases this turns out to be a very 
liberal estimate of the error; for example, in the group of 10 used earlier, 
typical errors are 


D{(00)(00)] = 0.00738 = .74% 
D{(00)(01)] = 2.28% 
D{(00)(02)] = 0.74% 


For the model with Q;(k.) given by Q(k2 | k,) for all members, the error 
in replacing the P; by P is even less. In this case, A and A’(X) are 0, and the 
error is then less than 1/n. 

This simplification is particularly useful because it introduces the least 
error for large groups, where it is most needed to simplify the calculation. 


IV. Summary 


Relational analysis defines the diadic relationship existing between pairs 
of members of a group in terms of their choices, their rejections, and their 
perceptions of being chosen and rejected. The number of possible diads is 45. 
In order to interpret the results of an experiment, we must have knowledge 
of the expected occurrence of the various diads on a chance basis, when only 
certain specified processes govern the chance distribution of diads. 

This paper discusses the construction of models which give the expected 
value and variance of the diads, when certain assumptions are made as to 
the random factors operating in the group. In general, the assumptions are 
that only very simple psychological factors are operating in the group, and 
that the occurrence of the various diad forms is the result of chance operating 
within the restriction of these factors. The observed data are then examined 
to determine whether the chance model accounts for the distribution of diad 
types, or whether additional psychological processes must be postulated. 
Models such as these are essential for testing various hypotheses about 
interaction in the group since they provide a method for setting up and testing 
a null hypothesis by the usual statistical methods. 

The models discussed in the paper were constructed on the assumption 
that choices and predictions were independent from member to member, and 
under the assumption that prediction was conditioned by choice in any pair 
as well as the assumption that choice and prediction were independent. The 
first of these assumptions was shown to account for a large part of the observed 
variation in diad frequency. Simplified assumptions which are valid for large 
groups were also discussed. 
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Models such as the second one discussed in this paper are typical of a 
large variety of models which could be constructed to test various hypotheses 
about the sources of the variation of frequency of the diad types. 
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Ashby, 12, 135-140 

Equivalence and reliability. Johnson, 15, 
115-119 

Equivalence coefficient. Cronbach, 12, 1-16; 
Cronbach and Warrington, 16, 167- 
188; Cronbach, 16, 297-334 

Ergographic experiment, rational function. 
Tucker, 20, 51-61 

Errata. Degan, 13, 87-89; Cureton, 18, 
la; Guttman, 20, 124 

Estimation, —— likelihood. Lord, 


truncated multivariate normal. Votaw, 


Rafferty, and Deemer, 15, 339-347 
Eta. Peters, 11, 57-69; Lord, 17, 181-194 
multiserial. Wherry and Taylor, 11, 155- 


161 
Eureka-type problems. Lorge and Solomon, 
20, 139-148 
Exhaustion method. Gengerelli, 13, 135- 
146; Layton, 16, 51-56 
F-tests, bias. Gourlay, 20, 227-248, 273-287 
error terms. Binder, 20, 29-50 
Factor analysis, aesthetic preferences. Peel, 
11, 129-137 
age levels. Burt, 12, 171-188; Jones, 14, 
299-331 
Air Force Classification Battery. Guil- 
ford, Fruchter, and Zimmerman, 17, 
45-68; Zachert and Friedman, 18, 
219-224 
analytic rotation. (see Factor analysis, 
rotation) 
anthropometric data. Thurtstone, 11, 
15-21; Burt, 12, 171-188; Heath, 17, 
87-100 
aptitude tests. (See Factor analysis, 
intelligence) 
body types. Thurstone, 11, 15-21; Burt, 
12, 171-188; Heath, 17, 87-100 
canonical. Rao, 20, 93-111 
centroid. Holley, 12, 263-265 
changes due to learning. Fleishman and 
Hempel, 19, 239-252 
closure factors. Botzum, 16, 361-386; 
Pemberton, 17, 267-288 
common elements. Gengerelli, 13, 69-77 
communalities. Medland, 12, 101-109; 
Rosner, 13, 181-184; Wherry, 14, 
231-241 
computational methods. Wherry and 
iner, 18, 161-179 
creative thinking. Wilson, Guilford, 
Christensen, and Lewis, 19, 297-311 
critique. McNemar, 16, 353-359 
descriptive vs. inferential. Hartley, 19, 
195-203 
econometric models. Reiers¢gl, 15, 121-149 


extension methods. Wherry, 14, 231-241 

factor scores. Guilford and Michael, 13, 
1-22; Baehr, 17, 107-126 

factorial invariance. Zimmerman, 18, 77- 


93 

figure classification tests. Myers, 17, 347- 
352 

fluency in writing. Taylor, 12, 239-262 

group structure. Winer, 20, 63-68 

hypothesis testing. Burt, 12, 171-188; 
Satasirs, 13, 251-257; Rippe, 18, 
191-205; Guttman, 19, 149-161; Hart- 
ley, 19, 195-203; Rao, 20, 93-111 

identifiability. Reiersgl, 15, 121-149 

image theory. Guttman, 18, 277-296 

individual tests. Rimoldi, 13, 27-46 

information theory. Ferguson, 19, 281- 
290 

intelligence. Rimoldi, 13, 27-46; Yela, 
14, 121-135; Jones, 14, 299-331; 
Rimoldi, 16, 75-101; Guilford, Fruch- 
ter, and Zimmerman, 17, 45-68; 
Pemberton, 17, 267-288; Zimmerman, 
18, 77-93; Green, Guilford, Christen- 
sen, and Comrey, 18, 135-160 (See 
also Factor analysis, reasoning) 

intelligence and personality tests. Denton 
and Taylor, 20, 75-81 

interest inventories. Cottle, 15, 25-47 

interpretation. Davis, 11, 249-255 

intra-individual. Cattell, Cattell, and 
Rhymer, 12, 267-288 

intra-task performance. Fleishman, 18, 

invariance under selection. Ahmavaara, 
19, 27-38 

item selection. Bedell, 15, 419-430 

iterative methods. Wherry, Campbell, 
and Perloff, 16, 67-74; Wherry and 
Winer, 18, 161-179 

job families. Coombs and Satter, 14, 
33-42 


latent class analysis. Green, 16, 151-166 

matching factors. Burt, 12, 171-188; 
Cattell, 12, 197-220; Jones, 14, 299- 
331; Cattell, Dubin, and Saunders, 19, 
209-230 

basis. Guttman, 17, 209- 

22 

maximum likelihood estimation. Rao, 20, 
93-111 

mechanical ability. Chapman, 13, 175-179 

motor tests (see Factor analysis, psy- 
chomotor tests). 

multiple-group method. Thurstone, 14, 
43-45; Guttman, 17, 209-222; Adcock, 
17, 249-253; Harman, 19, 39-55 

navigation variables. Carter and Dudek, 
12, 31-42 

neurotic tendency. MacCrone and Star- 
field, 14, 1-20 

nonverbal tests. Myers, 17, 347-352 

number of factors. Guttman, 19, 149-161 

oblique coordinates. Gulliksen and 
Tucker, 16, 233-238 


4 


342 


Factor analysis (cont.) 
. occupations. Coombs and Satter, 14, 33- 
42 


olfaction. Hsii, 11, 31-42 

orthogonal rotation. Zimmerman, 11, 
51-55 

orthogonal] vs. oblique structure. Gibson, 

_ 17, 317-323; Zimmerman, 18, 77-93; 
Harman, 19, 39-55 

P-technique. Cattell, 
Rhymer, 12, 267-288 

ey of sums of squares. Harris, 20, 


Cattell, and 


perception. Botzum, 16, 361-386; Guil- 
ford, Fruchter, and Zimmerman, 17, 
45-68; Green, Guilford, Christensen, 
and Comrey, 18, 135-160 

performance tests. Rimoldi, 13, 27-46; 
Fleishman, 18, 45-55 

personality. Cattell, 12, 197-220; Cattell, 
Cattell, and Rhymer, 12, 267-288; 
MacCrone and Starfield, 14, 1-20; 
Cottle, 15, 25-47; Thurstone, 16, 11- 
20; Baehr, 17, 107-126; Cattell, Dubin, 
and Saunders, 19, 209-230 

personality and intelligence tests. Denton 
and Taylor, 20, 75-81 

physiological variables. Cattell, Cattell, 
and Rhymer, 12, 267-288 

primary mental abilities battery. Zimmer- 
man, 18, 77-93 

principal axes. Green, 15, 57-61; Harris, 
20, 289-297 

principal components. Guttman, 20, 173- 
192 


profile similarity. Harris, 20, 289-297 

profiles. Sakoda, 19, 253-256 

psychomotor tests. Fleishman, 18, 45- 
55; Fleishman and Hempel, 19, 239-252 

Q-technique. Peel, 11, 129- 137: Cattell, 
12, 197-220; Sandler, 1%, 223-229: 
Sakoda, 19, 253-256 a 

rank. Guttman, 19, 149-161 

reading. Thurstone, 11, 185-188; Davis, 
11, 249-255 

reasoning. Botzum, 16, 361-386; Pember- 
ton, 17, 267 -288: Green, Guilford, 
Christensen, and Comrey, 18, 135-160; 
Matin and Adkins, 19, 71- 78; Wilson, 
Guilford, Christensen, and Lewis, 19, 
297-311 

reflection in centroid analysis. 
12, 263-265 

regression analysis. 
12, 31-42 

regression estimates. Peel, 11, 129-137 

relation between test length and loadings. 
Guilford and Michael, 15, 237-249 

reliability. Guilford and Michael, 15, 
237-249 

residuals. Wherry, 14, 231-241 

restriction in range. Ahmavaara, 19, 27-38 

rights and wrongs scores. Fruchter, 18 
257-265 

rotation. Zimmerman, 11, 51-55; Thurs- 


Holley, 
Carter and Dudek, 
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tone, 11, 71-79; Burt, 12, 171-188; 
Saunders, 13, 251-257; Yela, 14, 121- 
135; Sandler, 17, 223-229; Adcock, 17, 
249-253; Gibson, 17, 317-323; Carroll, 
18, 23-38; Zimmerman, 18, 77-93; 
Thurstone, 19, 173-182; Hartley, 19, 
195-203; Ferguson, 19, 281-290: 
Tucker, 20, 209-225 

sampling errors. Rippe, 18, 191-205 

scholastic achievement. Yela, 14, 121-135 

second-order. Rimoldi, 13, 27-46; Yela, 
14, 121-135; Rimoldi, 16, 75-101; 
Botzum, 16, 361-386; Baehr, 17, 107- 
126; Matin and Adkins, 19, 71-78 

selection of population. Degan, 13, 87- 
89; Birnbaum, Paulson, and Andrews, 
15, 191-204; Ahmavaara, 19, 27-38 

separation of data. Harris, 20, 23-28 

Sheppard Field Battery. Guilford, 
Fruchter, and Zimmerman, 17, 45-68 

simple structure (see Factor analysis, 
rotation). 

simplex. Guttman, 20, 173-192 

speeded tests. Myers, 17, 
Fruchter, 18, 257-265 

stability. MacCrone and Starfield, 14, 
1-20; Fleishman, 18, 45-55; Zachert 
and Friedman, 18, 219-224 

Stanford-Binet. Jones, 14, 299-331 

suppressor variables. Wherry, 11, 239-247 

temperament. Thurstone, 16, 11-20; 
Baehr, 17, 107-126 

test construction. Wherry and Winer, 
18, 161-179; Loevinger, Gleser, and 
DuBois, 18, 309-317 

test-equivalents. Sandler, 17, 223-229 

test items. Jones, 14, 299-331; Wherry 
and Winer, 18, 161-179 

test scores. Guttman, 17, 209-222 

tests of significance (see Factor analysis, 
hypothesis testing). 

theory. Peel, 11, 129-137; Guttman, 17, 
209-222; Ahmavaara, 19, 27-38; Har- 
man, 19, 39-55; Hartley, 19, 195-203; 
Harris, 20, 23-28; Rao, 20, 93-111 

transformations. Gibson, 17, 317-323; 
Harman, 19, 39-55 

USAFI tests. Findley and Andregg, 14, 
47-60 


347-352; 


Fatigue, rational function. Tucker, 20, 


oint biserial correlation. 
ichael, 19, 313-325 
analysis of 


Fiducial limits, 
Perry and } 
Fixed vs. random variables, 
variance. Binder, 20, 29-50 
Forced choice, number of alternatives. 
Fowler, 12, 221-232 
rationale. Coombs, 13, 59-68; Brogden, 
19, 141-148 
reliability. Fowler, 12, 221-232 
Game theory, classification. Votaw, 17, 
255-266; Dwyer, 19, 11-26 
Goodness of fit, paired comparisons. Mos- 
teller, 16, 207-218 
tests. Lindquist, 12, 65-78 
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Gramian matrices, minimum rank. Gutt- 
man, 19, 149-161 
Greenwood-Yule model, negative binomial. 
Sichel, 16, 107-127 
Group problem-solving. Lorge and Solomon, 
20, 139-148 
Group structure, correlation. Winer, 20, 
63-68 
matrix methods. Luce and Perry, 14, 
95-116 
measurement. Luce, 15, 169-191 
status index. Katz, 18, 39-43 
Guttman scale analysis. Edwards 
Kilpatrick, 13, 99-114; Hays 
Borgatta, 19, 271-279 
History, mathematical model. Richardson, 
13, 147-174, 197-232 
Homogeneity, item. Cronbach, 16, 297-334 
Homogeneity of residuals, tests. Gulliksen 
and Wilks, 15, 91-114 
Homogeneity of variance, standard error 
measurement. Mollenkopf, 14, 189-229 
Homogeneous tests, construction. Loe- 
vinger, Gleser, and DuBois, 18, 309- 
317 : 
Hotelling’s 7’. Tyler, 17, 289-296 
Hypothesis testing, analysis of variance. 
Binder, 20, 29-50; Gourlay, 20, 227-248 
classification. Anderson, 16, 31-50 
combining tests of significance. Gordon, 
Loveland, and Cureton, 17, 311-316 
correlated proportions. McNemar, 12, 
153-157; Edwards, 13, 185-187 
correlations. Coombs, 13, 233-243 
curvilinearity of regression. Peters, 11, 


and 
and 


error of measurement. Green, 15, 251-257 

factor analysis (see Factor analysis, 
hypothesis testing). 

information transmission. McGill, 19, 
97-116 

Johnson-Neyman_ technique. Abelson, 
18, 213-218 

latent structure analysis. McCarthy, 16, 
247-269; Anderson, 19, 1-10 

Latin squares. Gourlay, 20, 273-287 

learning. Boguslavsky, 20, 125-138 

linear. Rulon, 14, 259-278; Johnson and 
Fay, 15, 349-367 

multivariate. Gulliksen and Wilks, 15, 
91-114; Tyler, 17, 289-296 

negative binomial. Sichel, 16, 107-127 

non-parametric (see Non-parametric 
methods). 

paired comparison scaling. Mosteller, 16, 
207-218 

percentages. Swineford, 11, 43-49; Swine- 
ford, 13, 238-25; Swineford, 14, 183- 
187; Appel, 17, 325-330 

point biserial correlation. 
Michael, 19, 313-325 

ranks. Moses, 17, 239-247; Lyerly, 17, 
421-428 

scale analysis. McCarthy, 16, 247-269 

selection procedures. Sichel, 17, 1-39 


Perry and 
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skewed distributions. Taylor, 12, 111- 
125; Sichel, 16, 107-127 

sociometric measures. Katz and Powell, 
18, 249-256 

trend. Lindquist, 12, 65-78: Alexander, 
12, 79-99 

true scores. Lord, 18, 57-76 

truncated distributions. Maritz, 18, 97- 


110 
IBM methods, intraserial correlation. Payne 
and Staugas, 20, 87-92 
item biserial correlations. 
Cureton, 17, 41-43 
square roots. Reynolds, 11, 223-238 
sums of products. Ellis and Riopelle, 13, 
79-85; Castore and Dye, 14, 243-250; 
Burke, 17, 231-233; Avers and Stanley, 
17, 305-310 
Idempotent matrices, 
arris, 20, 23-28 
Identifiability. Reiersgl, 15, 121-149 
Image theory. Guttman, 18, 277-296 
Index, curvilinearity. Peters, 11, 57-69 
Information, factor analysis. Ferguson, 19, 
281-290 
multivariate transmission. McGill, 19, 
6 


Siegel and 


factor analysis. 


test. Woodbury, 16, 103-106; Cronbach, 
19, 263-270 
Interaction, computational techniques. 
Edwards and Horst, 15, 17-24 
correlation. Coombs, 13, 233-243 
trend analysis. Lindquist, 12, 65-78 
Internal consistency, speeded tests. Gullik- 
sen, 15, 259-269; Cronbach and 
Warrington, 16, 167-188; Angoff, 18, 
1-14 
Intervening variables, mathematical model. 
Boguslavsky, 20, 125-138 
Intraclass correlation. Alexander, 12, 79- 
99; Coombs, 13, 233-243; Ebel, 16, 
407-424 
Intra-individual correlation, factor analysis. 
— Cattell, and Rhymer, 12, 267- 


Intraserial correlation, computation. Payne 
and Staugas, 20, 87-92 

Inverse, triangular matrix. Fruchter, 14, 
89-9; 


Item analysis, biserial correlation. Siegel 

and Cureton, 17, 41-43 

criterion-keying. French, 17, 101-106 

difficulty. Cronbach and Warrington, 17, 
127-147; Lord, 17, 181-194 

electronic computer. Ziegler, 19, 261-262 

extreme groups. Fan, 19, 231-237 

homogeneous tests. Loevinger, Gleser, 
and DuBois, 18, 309-317 

maximum validity. Gleser and DuBois, 
16, 129-139; French, 17, 101-106, 
Webster, 18, 207-211 

non-parametric methods. Webster, 18, 
207-211 

— of items. Mollenkopf, 15, 291- 
3l 
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Item analysis (cont.) 
scaling methods. Edwards and Kilpatrick, 
13, 99-114 
selection of unifactor items. Bedell, 15, 
419-430 
speeded tests. Mollenkopf, 15, 291-315 
unattempted items. Gu liksen, 15, 259- 
269 
validity coefficient. Johnson, 12, 51-58 
Item|characteristic function. Lord, 18, 57-76 
Item difficulty, optimal distribution. Cron- 
bach and Warrington, 17, 127-147 
Iterative solution, regression weights. Horst 
and Smith, 15, 271-289 
J-scales. Coombs, 19, 183-194 
Job analysis, factor analysis. Coombs and 
Satter, 14, 33-42 
Job distances. Thorndike, 18, 267-276 
Johnson-Neyman technique. Johnson and 
Fay, 15, 349-367; Abelson, 18, 213-218 
Joint-occurrence matrices, latent class 
analysis. Green, 16, 151-166 
KG index. Johnson, 12, 51-58 
Keying items, multiple-choice. French, 17, 
161-106 
Kurtosis, binomial. Gengerelli, 13, 69-77 
error of measurement. Mollenkopf, 14, 
189-229 
Latent roots, Gramian matrices. Guttman, 
19, 149-161 
Latent structure analysis. Green, 16, 151- 
166; McCarthy, 16, 247-269; Anderson, 
19, 1-10; Hays and Borgatta, 19, 271- 
279; Gibson, 20, 69-73 
Latent variables. Reiersgl, 15, 121-149 
Latin square, analysis of variance. Gourlay, 
20, 273-287 
Learning, models. Miller, 17, 149-167; 
Miller and McGill, 17, 369-396; Kao, 
18, 241-243; Goodman, 18, 245-248; 
Gulliksen, 18, 297-307; Boguslavsky, 
20, 125-138; Restle, 20, 201-208 
psychomotor. Fleishman and Hempel, 
19, 239-252 
Learning curves. Gulliksen, 18, 297-307 
Least squares, estimation. Brogden, 20, 
249-252 
image theory. Guttman, 18, 277-296 
scaling. Mosteller, 16, 3-9 
Length of test, effect on factor loadings. 
Guilford and Michael, 15, 237-249 
effective. Angoff, 18, 1-14 
optimal. Horst, 13, 125-134; Horst, 14, 
79-88; Horst, 16, 189-202 
Likelihood ratio ‘criterion. Gulliksen and 
Wilks, 15, 91-114; Green, 15, 251-257 
Linear hypothesis. Rulon, 14, 259-278; 
Binder, 20, 29-50 
Linear programming, classification. Votaw, 
17, 255-266; Dwyer, 19, 11-26; Brog- 
den, 20, 249-252 
Linearity, ‘tests. Lindquist, 12, 65-78; 
Alexander, 12, 79-99 
Markov chains. Miller, 17, 149-167; Kao, 
18, 241-243; Goodman, 18, 245-248 
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Matching profiles, purposes. Cattell, 14, 
279-298 


Matrix, rearrangement. Gibson, 18, 111-113 
sociometric analysis. Luce, 15, 169-191 
theorems. Reiersgl, 15, 121- 149; Gutt- 

man, 19, 149-161 
Maximum likelihood estimation, equating 
tests. Lord, 20, 193-200 
negative binomial. Sichel, 16, 107-127 
reliability. Alexander, 12, 79-99 
true scores. Lyerly, 16, 21-30; Lord, 18, 
57-76 
Measurement, theory. 
387-406 


Mechanical model, oblique coordinates. 
Gulliksen and Tucker, 16, 233-238 
Methodology. Flanagan, 17, 359-368 
Minimax principle, classification. Anderson, 
16, 31-50 
Models, analysis of variance. Rulon, 14, 
259-278; Binder, 20, 29-50; Gourlay, 
20, 227-248, 273-287 
automata. Frankel, 20, 149-162 
conditioning. Boguslavsky, 20, 125-138 
cumulative scale analysis. Hays and 
Borgatta, 19, 271-279 
ergographic experiment. Tucker, 20, 51-61 
group vs. individual problem-solving. 
Lorge and Solomon, 20, 139-148 
history. Richardson, 13, 147-174, 197-232 
identifiability of structural variables. 
Reiersgl, 15, 121-149 
intervening variables. Boguslavsky, 20, 
125-138 
latent class. Green, 16, 151-166 
latent structure analysis. Anderson, 19, 
1-10; Gibson, 20, 69-73 
learning (see Learning, models). 
multidimensional scaling. Torgerson, 17, 
401-419 
nervous system. Ashby, 12, 135-140; 
Frankel, 20, 149-162 
psychometric. Tucker, 20, 267-271 
relational analysis. Luce, Macy, 
Tagiuri, 20, 319-327 
selection of personnel. Sichel, 17, 1-39; 
Arbous and Sichel, 17, 331-346 
speed and length of _Problem. Birren and 
Botwinick, 16, 219-232 
war-moods. Richardson, 13, 147-174, 
197-232 
Monte Carlo method, digital computers. 
Frankel, 20, 149- 162 
Multiple-choice items, reliability. Lord, 17, 
181-194 
Multiple-choice tests, number of alterna- 
tives. Hamilton, 5, 151-168 
Multiple-choice vs. answer-only 


Weitzenhoffer, 16, 


and 


items. 


Plumlee, 17, 69-86; Plumlee, 19, 65-70 
Multiple correlation, biserial. Wherry, 12, 
189-195 
bounds. Horst, 16, 57-66 
differences. Mollenkopf, 15, 409-417 
135-146; 


estimation. Gengerelli, 13, 
Layton, 16, 51-56 
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Multiple regression, computational pro- 

cedures. Van Boven, 12, 127-133 
square root method. Summerfield and 

Lubin, 16, 271-284, 423-437; Harman, 
19, 39-55 

Multiplicative scales. Weitzenhoffer, 16, 
387-406 

Multiserial correlation, general formulas. 
Jaspen, 11, 23-30 

Multivariate analysis. Gulliksen and Wilks, 
15, 91-114; Anderson, 16, 31-50; 
Ty ler, 17, 289-296: McGill, 19, 97-116 

Multivariate normal. Birnbaum, Paulson, 
and Andrews, 15, 191-204; "Anderson, 
16, 31-50 

Negative binomial, estimation of para- 
meters. Sichel, 16, 107-127 

Nervous system, model. Frankel, 20, 149- 
162 


Nomographs, chi square for frequency of 
differences. Voss, 12, 43-4 
in percentages. Appel, 17, 
32. -3. 
inter-item correlation and true score. 
Tucker, 11, 1-13 
percentages and sample size. Swineford, 
11, 43-49 
range. Schiitzenberger, 13, 95-97 
tetrachoric correlation. Hayes, 11, 163- 
172; Hamilton, 13, 259-269 
Non-attempted test items, biases in scoring. 
Hamilton, 15, 151-168 
Non-parametric methods, item analysis. 
Webster, 18, 207-211 
means. Festinger, 11, 97-105 
range. Schiitzenberger, 13, 95-97 
ranks. Moses, 17, 239-247; Lyerly, 17, 
421-428 
sign test. Voss, 12, 43-49 
Normalizing units. Leverett, 12, 141-152 
Oblique coordinates, model. Gulliksen and 
Tucker, 16, 233-238 
Operating characteristic curves, selection 
procedures. Sichel, 17, 1-39 
Optimal — classification. Dwyer, 19, 
11-2 
Order-relations, simplex. Guttman, 20, 
173-192 
Ordinal scales. Weitzenhoffer, 16, 387-406 
Organizational analysis, correlation co- 
efficient. Winer, 20, 63-6 
P-technique (see Factor P- 
technique). 
Paired comparisons. Comrey, 15, 317-325 
case V. Mosteller, 16, 3-9; Mosteller, 16, 
207-218 
— Edwards and Thurstone, 17, 169- 
1 
sealing. Burros and Gibson, 19, 57-64; 
Coombs, 19, 183-194 
theory. Coombs, 13, 59-68 
unequal variances. Mosteller, 16, 203-206 
Panel selection. Harrison, Ishler, and Laue, 
19, 79-88 
Parabolic correlation. Peters, 11, 57-69 


curves, fitting. Chapanis, 18, 327- 


Parailel tests, criterion. Lord, 20, 1-22 
Votaw’s test. Green, 15, 251-257 

Partitions, scociometric paths. Ross and 
Harary, 17, 195-208 

Pattern analysis, common elements. Genger- 
elli, 13, 69-77 

Pattern similarity, index. Cattell, 14, 279- 
298 


Q-technique. Sakoda, 19, 253-256 
Percentages, correlated. Sw ineford, 13, 23- 
25 


sample size. Swineford, 11, 43-49 
testing hypotheses. McNemar, 12, 153- 
157; Edwards, 13, 185-187; Swineford, 
14, 183-187; Appel, 17, 325-330 
Performance parameters, effect of age. 
Birren and Botwinick, 16, 219-232 
— test, validity. French, 13, 271- 


Personality tests, forced-choice items. Brog- 
den, 19, 141-148 
objective. Cattell, Dubin, and Saunders, 
19, 209-230 
Personnel classification (see Classification). 
Personnel selection. Sichel, 17, 1-39; Arbous 
and Sichel, 17, 331-346 
Phi coefficient, corrections. Guilford and 
Perry, 16, 335-346 
tables. Jurgensen, 12, 17-29 
Placement, item. Mollenkopf, 15, 291-315 
Point biserial correlation, computation. 
Dingman, 19, 257-259 
fiducial limits. Perry and Michael, 19, 
313-325 
Poisson distribution, compound. Sichel, 16, 
107-127 
Poisson variable, transformation. Bruner, 
Postman, and Mosteller, 15, 63-72 
Politics, — Richardson, 13, 147-174, 
1 


Pooling rules, analysis of variance. Binder, 


Power functions, true scores. Lord, 18, 57- 
76 


Prediction, addition of single test to battery. 
Horst, 16, 57-66 
differential (see Classification). 
distribution of scores. Ferguson, 14, 61-68 
efficiency. Brogden, 14, 169-182 
optimal test length. Horst, 14, 79-88; 
Horst, 16, 189-202 
test length. Horst, 15, 407-408 
testing time. Taylor, 15, 391-406 
Principal axes, profile analysis. Harris, 20, 
289-297 
— determining weights. Green, 15, 
Probability, conditioning model. Boguslav- 
sky, 20, 125-138 
diadic relations. Luce, Macy, and Tagiuri, 
20, 319-327 
group problem-solving model. Lorge and 
Solomon, 20, 139-148 
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Probability (cont.) 
learning model. Miller and — Av, 
369-396; Gulliksen, 18, 297-3 
Markov chains. Miller, 17, 149- 167; Kao, 
18, 241-243; Goodman, 18, 245-248 
psychological. Coombs, 19, 183-194 
Probit analysis, biserial correlation. Maritz, 
18, 97-110 
Problem-solving, group vs. individual. 
Lorge and Solomon, 20, 139-148 
Profile analysis. Harris, 20, 289-297 
classification. Dwyer, 19, 11-26 
job clusters. Thorndike, 18, 267-276 
Q-factor analysis. Sakoda, 19, 253-256 
Profile matching. Cattell, 14, 279-298 
Proportions (see Percentages). 
Psychometric Society, constitution. 14, 
251-256 
membership list. 17, 447-456 
relation to other statistical groups. 15, 
205-207 


Psychometricians, general _ objectives. 
Adkins, 15, 331-338 
Psychometrika, rules for preparation of 


manuscripts. 18, 265 (c) 
Psychophysical curves, speed and length 
of problem. Birren and Botwinick, 16, 
219-232 
Psychophysical scaling, multidimensional. 
Torgerson, 17, 401-419 
Psychophysics (see also Scaling). 
panel selection. Harrison, Ishler, 


Laue, 19, 79-88 
Gulliksen, 19, 117- 


and 


successive intervals. 


139 
test theory. Perry, 18, 319-325 
Q-technique (see Factor analysis, Q- 
technique). 


Qualitative variables, distance between 
classes. Wherry and Taylor, 11, 155-161 
information transmission. McGill, 19, 
97-116 
multiserial eta. Wherry and Taylor, 11, 
155-161 
regression. Wherry, 12, 189-195 
reliability. Guttman, 11, 81-95 
Quality control, scoring tests. Kimball, 15, 
1-15 
selection. Sichel, 17, 1-39 
Rank, matrix. Guttman, 19, 149-161 
theorems. Reiers¢gl, 15, 121-149 
Rank correlation, average. Lyerly, 17, 421- 


Rank-order methods, testing means. Festin- 
ger, 11, 97-105 
Ratings, reliability (see Reliability, ratings). 
status index. Katz, 18, 39-43 
Ratio scaling. Comrey, 15, 317-325 
Recall, model. Miller and Mec Gill, 17, 369- 
396 
Rectification, parabolas. Chapanis, 18, 327- 


sociometric analysis. 
14, 95-116; Ross and 


336 
Redundant chains, 
Luce and Perry, 


Harary, 17, 195- 208 
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Regression, analysis of variance. Bruner, 
Postman, and Mosteller, 15, 63-72 
curvilinear. Peters, 11, 57-69; Lord, 17, 

181-194 
criterion. Wherry, 12, 189- 
19 
factor analysis. Carter and Dudek, 12, 
31-42 
hypothesis testing. Gulliksen and 
Wilks, 15, 91-114 
image theory. Guttman, 18, 277-296 
prediction of differences. Mollenkopf, 15, 
409-417 
psychophysics. Perry, 18, 319-326 
real scores on raw scores. Hamilton, 15, 
151-168; Lyerly, 16, 21-30 
tests for homogeneity. Gulliksen and 
Wilks, 15, 91-114 
Regression weights, computation. Genger- 
elli, 13, 135-146 
length of test. Horst, 13, 125-134 
Relational analysis, model. Luce, Macy, and 
Tagiuri, 20, 319-327 
Reliability, analysis of variance. Alexander, 
12, 79-97; Horst, 14, 21-31; Cronbach 
and Warrington, 16, 167-188; Cron- 
bach, 16, 297-334; Ebel, 16, 407-424; 
Cronbach and Warrington, 17, 127-147 
bounds. Cronbach and Warrington, 16, 
167-188; Guttman, 18, 225-239; Gutt- 
man, 20, 113-124 
chance success. Plumlee, 17, 69-86 
comparison of methods. Gulliksen, 15, 
259-269; Cronbach, 16, 297-334 
computation of maximizing weights. 
Green, 15, 57-61 
constancy of standard error of measure- 
ment. Mollenkopf, 14, 189-229 
correction for non-attempts. Gulliksen, 
15, 259-269; Cronbach and Warring- 
ton, 16, 167- 188 
correction for number of raters. Cureton, 
17, 397-399 
correction for speeded tests. 
20, 113-124 
critique. Guttman, 18, 123-130; Gullik- 
sen, 18, 131-133 
definitions. Cronbach, 12, 1-16 
difficulty. Brogden, 11, 197-214; Tucker, 
14, 117-119; Plumlee, 17, 69-86; 
Horst, 19, 291-296 
equivalence. Johnson, 15, 115-119; Cron- 
bach, 16, 297-334 
equivalent ‘items. Tucker, 11, 1-13 
experimental dependence. Guttman, 18, 
225-239 
factor analysis. Cronbach, 12, 1-16; 
Guilford and Michael, 15, 237-249 
forced choice items. Fowler, 12, 221-232 
generalized. Horst, 14, 21-31; Guttman, 
20, 113-124 
item difficulty. Cronbach and Warring- 
ton, 17, 127-147; Lord, 17, 181-194 
item precision. Cronbach and Warring- 
ton, 17, 127-147 


Guttman, 
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length of test. Horst, 18, 125-134 
maximizing by item selection. Bedell, 15, 
419-430 
modification of KR-20. Tucker, 14, 117- 
119 
navigator performance. Carter and 
Dudek, 12, 31-42 
qualitative data. Guttman, 11, 81-95 
ratings. Horst, 14, 21-31; Ebel, 16, 407- 
424; Cureton, 17, 397-399 
relation to validity. Horst, 14, 79-88 
sampling. Lord, 20, 1-22 
sociometric measures. Katz and Powell, 
18, 249-256 
sources of error variance. Cronbach, 12, 
1-16 
speeded tests. Gulliksen, 15, 259-269; 
Cronbach and Warrington, 16, 167- 
188; Angoff, 18, 1-14; Guttman, 18, 
225-239; Guttman, 20, 113-124 
split-half. Cronbach, 16, 297-334 
standard length. Woodbury, 16, 103-106 
test for equality. Green, 15, 251-257 
theory. Guttman, 18, 225-239; Guttman, 
20, 113-124 
validity. Horst, 16, 189-202 
variation across ranges of scores. Mollen- 
kopf, 14, 189-229 
Rest-work curve. Tucker, 20, 51-61 
Restriction in range, estimation of para- 
meters. Votaw, Rafferty, and Deemer, 
15, 339-347 
Reward vs. punishment, learning theory. 
Gulliksen, 18, 297-307 
Rotation (see Factor analysis, rotation). 
Sample size, percentages. Swineford, 11, 43- 
49; Swineford, 14, 183-187 
Sampling, items vs. examinees. Lord, 20, 
sequential. Kimball, 15, 1-15 
Saturation coefficient. Loevinger, Gleser, 
and DuBois, 18, 309-317 
Sealing. critique. Guttman, 18, 123-130; 
Gulliksen, 18, 131-133 
discriminal dispersion. Rimoldi and Hor- 
maeche, 20, 307-318; Burros, 20, 299- 
305 
factor analysis. Wherry and Winer, 18, 
161-179 
Guttman scale analysis. McCarthy, 16, 
247-269 
incomplete data. Gulliksen, 19, 117-139 
internal consistency check. Edwards and 
Thurstone, 17, 169-180 
interstimulus similarity. Coombs, 19, 
183-194 
latent distance analysis. Hays and 
Borgatta, 19, 271-279 
least squares. Mosteller, 16, 3-9 
multidimensional. Torgerson, 17, 401- 
419 
paired comparisons. Mosteller, 16, 3-9; 
Mosteller, 16, 203-206; Mosteller, 16, 
207-218; Gibson, 18, 15-21; Burros 
and Gibson, 19, 57-64 


panel selection. Harrison, Ishler, and 
Laue, 19, 79-88 
ranks. Cureton, 17, 397-399 
ratio. Comrey, 15, 317-325 
simplex. Guttman, 20, 173-192 
successive intervals. Edwards and Thur- 
stone, 17, 169-180; Gulliksen, 19, 117- 
139; Rimoldi and Hormaeche, 20, 307- 
318; Burros, 20, 299-305 
test construction. Cronbach, 16, 297-334 
tests of goodness of fit. Mosteller, 16, 
207-218 
theory. Coombs, 13, 59-68: Edwards and 
Kilpatrick, 13, 99-114; Weitzenhoffer, 
16, 387-406; Edwards and Thurstone, 
17, 169-180 
unfolding technique. Coombs, 19, 183-194 
Scalogram board, substitute. Gibson, 18, 
111-113 
Scatter diagram, oblique coordinates. 
Gulliksen and Tucker, 16, 233-238 
Scholastic achievement, prediction. French, 
13, 271-277 
Scoring, personality tests. Brogden, 19, 
141-148 


Scoring formulas, wrongs score. Fruchter, 
18, 257-265 

Scoring keys, multiple-choice items. French, 
17, 101-106 

Scoring tests, biases. Hamilton, 15, 151-168 
checking accuracy. Kimball, 15, 1-15 
correction for guessing. Hamilton, 15, 

151-168; Lyerly, 16, 21-30 
factor content. Fruchter, 18, 257-265 

Second-order factors (see Factor analysis, 

second-order). 

Selection (see also Classification). 
classification. Votaw, 17, 255-266 
efficiency. Brogden, 14, 169-182 
items. Gleser and DuBois, 16, 129-139; 

Webster, 18, 207-211 
multidimensional populations. Birnbaum, 
Paulson, and Andrews, 15, 191-204 
panel of judges. Harrison, Ishler, and 
Laue, 19, 79-88 
personnel. Thorndike, 15, 215-235: Sichel, 
17, 1-39; Arbous and Sichel, 17, 331- 
346 
population. Votaw, Rafferty, and Deemer, 
15, 339-347 
test. Summerfield and Lubin, 16, 271- 
284, 423-437 
test length. Taylor, 15, 391-406; Horst, 
15, 407-408 
tests adjusted for time. Long and Burr, 
14, 137-161 
unifactor items. Bedell, 15, 419-430 
Selection on population, item parameters. 
Gulliksen, 16, 285-296 
Selection ratio, optimum difficulty. Cron- 
bach and Warrington, 17, 127-147 
Selective efficiency, personnel. Brogden, 
11, 1389-154; Brogden, 19, 205-208 
pre-screening. Arbous and Sichel, 17, 
331-346 
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Selective efficiency (cont.) 
test battery. Sichel, 17, 1-39 
Self-organizing systems. Ashby, 12, 135-140 
Semi-partial correlation, square root 
method. Summerfield and Lubin, 16, 
271-284, 423-437 
Sequential analysis, scoring tests. Kimball, 
15, 1-15 
Serial correlation,interrelationships. Wherry 
and Taylor, 11, 155-161 
multiserial eta. Wherry and Taylor, 11, 
155-161 
psychophysics. Perry, 18, 319-325 
triserial and higher. Jaspen, 11, 23-30 
Sign test, paired differences. Voss, 12, 43-49 
Simple ‘structure (se Factor analy sis, 
rotation). 
Simplex, factor analysis. Guttman, 20, 173- 
192 


Simplex method, classification. Dwyer, 19, 
11-26 
linear programming. Votaw, 17, 255-266 
Skewed distributions, negative binomial. 
Sichel, 16, 107-127 
Skewness, binomial. Gengerelli, 13, 69-77 
influence on standard error of measure- 
ment. Mollenkopf, 14, 189-229 
tests. Taylor, 12, 111-125 
Sociometric analysis. Luce, 15, 169-191 
diadic relations. Luce, Macy, and Tagiuri, 
20, 319-327 
matrix methods. 
95-116 
overlapping groups. Winer, 20, 63-68 
redundant paths. Ross and Harary, 17, 
195-208 
reliability. Katz and Powell, 18, 249-256 
status index. Katz, 18, 39- 43 
Speed of addition, length of problem. 
Birren and Botwinick, 16, 219-232 
Speed vs. power, reliability. Guttman, 20, 
113-124 
Speeded tests, reliability (see Reliability, 
speeded tests). 
scoring. Fruchter, 18, 257-265 
Split-half reliability, critique. Cronbach, 16, 
297-334 
Square root method. Summerfield and 
Lubin, 16, 271-284, 423-437; Harman, 
19, 39-55 
Stability coefficient. Cronbach, 12, 1-16 
Standard errors, discriminant statistic Ps. 
Betts, 15, 435-439 
test statistics. Lord, 20, 1-22 
Standard error of measurement, influence 
of skewness and kurtosis. Mollenkopf, 
14, 189-229 
test for equality. Green, 15, 251-257 
variation over test-score range. Mollen- 
kopf, 14, 189-229 
— test length. Woodbury, 16, 103- 
1 


Status index. Katz, 18, 39-43 
Status score, attitude scaling. Coombs, 13, 
59-68 


Luce and Perry, 14, 
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Stimulus scales, construction. Coombs, 19, 
183-194 
Stochastic models, learning model. Miller 
and McGill, 17, 369-396 
Markov chains. Miller, 17, 149-167; ae 
18, 241-243; Goodman, 18, 245-248 
nervous system. Frankel, 20, 149-162 
Structural variables, identifiability. 
Reiersgl, 15, 121-149 
Successive intervals, internal consistency. 
Edwards and Thurstone, 17, 169-180 
scaling. Gulliksen, 19, 117-139 
Successive residuals. Horst and Smith, 15, 
271-289; Layton, 16, 51-56 
Sufficient statistics, true scores. Lord, 18, 
57-76 
Summated ratings, scaling. Edwards and 
Kilpatrick, 13, 99-114 
Suppressor variables, factor analysis. Guil- 
ford and Michael, 13, 1-22 
personality tests. Brogden, 19, 141-148 
test selection. Wherry, 11, 239-247 
Tables, chi square for two degrees of 
freedom. Gordon, Loveland, and Cure- 
ton, 17, 311-316 
correlated percentages. Swineford, 13, 
23-25 
high-low-27-per cent. Fan, 19, 231-237 
item discrimination. Fan, 19, 231-237 
mean of truncated normal deviates. 
Leverett, 12, 141-152 
negative binomial. Sichel, 16, 107-127 
percentages and sample size. Swineford, 
1, 43-49; Swineford, 14, 183-187 
phi coefficient. Jurgensen, 12, 17-29 
point biserial correlation. Perry and 
Michael, 19, 313-325 
rank tests on means. Festinger, 11, 97-105 
skewness (significance levels). Taylor, 12, 
111-125 
tests for ranks. Moses, 17, 239-247 
tetrachoric correlation. Davidoff and 
Goheen, 18, 115-121; Davidoff, 19, 
163-164; Hamilton, 13, 259-267; Welsh, 
20, 83-85; Jenkins, 20, 253-258 
Test construction, difficulty and validity. 
Cronbach and Warrington, 17, 127-147 
factor — Wherry and Winer, 18, 
161- 
forced choice items. Fowler, 12, 221-232 
homogeneous tests. Loevinger, Gleser, 
and DuBois, 18, 309-317 
item difficulty. Lord, 17, 181-194 
item homogeneity. Cronbach, 16, 297-334 
iterative factor analysis. Wherry, Camp- 
bell, and Perloff, 16, 67-74 
length. Horst, 13, 125- 134; Horst, 14, 
79-88; Horst, 16, 189-202 
maximum validity. Gleser and DuBois, 
16, 129-139; Webster, 18, 207-211 
multiple-choice items. Plumlee, 17, 69- 
86; Plumlee, 19, 65-70 
persistence test. French, 13, 271-277 
are questionnaires. "Brogden, 19, 
141-1 
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scaling items. Edwards and Kilpatrick, 
13, 99-114 
univocal factor scores. Guilford and 
Michael, 13, 1-22 
Test om. Horst and Smith, 15, 271- 
289 


computational procedures. Layton, 16, 
51-56 

integral weights. Wherry and Gaylord, 
11, 173-183 

square root method. Summerfield and 
Lubin, 16, 271-284, 423-437 

suppressor variables. Wherry, 11, 239-247 

weights. Gengerelli, 13, 135- 146 

Wherry-Doolittle adjusted for time. Long 
and Burr, 14, 137-161 

Test theory, correction for guessing. Hamil- 

ton, 15, 151-168; Lyerly, 16, 21-30 

critique. Guttman, 18, 123- 130; Gullik- 
sen, 18, 131- 133 

differential validity. Cronbach and War- 
rington, 17, 127-147 

difficulty related to reliability and 
validity. Brogden, 11, 197-214 

discrimination. Ferguson, 14, 61-68 

equivalent items. Tucker, 11, 1-13 

equating tests. Lord, 20, 193-200 

error of measurement, Green, 15, 251-257 

factor structure. Guilford and Michael, 
15, 237-249 

forced-choice items. Brogden, 19, 141-148 

information. Cronbach, 19, 263-270 

item difficulty. Lord, 17, 181-194 

item parameters. Gulliksen, 16, 285-296 

item placement. Mollenkopf, 15, 291-315 

item selection. Wherry and Gaylord, i; 
173-183 

optimal length. Horst, 13, 125-134; 
Horst, 14, 79-88; Taylor, 15, 391-406: 
Horst, 15, 407-408; Horst, 16, 189-202 

psychophysics. Perry, 18, 319-326 

qualitative items. Guttman, 11, 81-95 

quantification of measurements. "Cureton, 
11, 191-196 

reliability. Guttman, 11, 81-95; Cron- 
bach, 12, 1-16; Cronbach, 16, 297-334; 
Guttman, 18, 225-239 

reliability and validity. Tucker, 11, 1- rg 

sampling items vs. examinees. Lord, 2 
1-22 

scoring. Fruchter, 18, 257-265 

— populations. Gulliksen, 16, 285- 


wan and power. Gulliksen, 15, 259-269; 
Mollenkopf, 15, 291-315; ‘Cronbach 
and W: arrington, 16, 167- 188; Angoff, 
18, 1-14 

standard errors. Lord, 20, 1-22 

standard length. Woodbury, 16, 103-106 

test selection. Wherry, 11, 239-247 

true scores. Lord, 18, 57-76 

validity. Horst, 16, 57-66; Plumlee, 19, 
65-70 

variation in standard error of measure- 
ment. Mollenkopf, 14, 189-229 j 


Tetrachoric correlation, computation. 
Hamilton, 13, 259-269; Davidoff and 
Goheen, 18, 115-121; Davidoff, 19, 
163- 164; Welsh, 20, 83-85; Jenkins, 
20, 253-258 

computing diagrams. Hayes, 11, 163-172 

estimation from phi coefficient. Guilford 
and Perry, 16, 335-346 

nomograph. Hamilton, 13, 259-269 

worksheet. Goheen and Kavruck, 13, 
279-280 

Thurstone learning function, generalization. 
Gulliksen, 18, 297-307 

Time series, correlation. Payne and Staugas, 

, 87-92 

Toops Z-method. Adkins, 14, 69-73 

Training methods, navigation. Carter and 
Dudek, 12, 31-42 

Transformations (see also Factor analysis, 
rotation). 

orthogonal to oblique coordinates. Gullik- 
sen and Tucker, 16, 233-238 

parabolic curves. Chapanis, 18, 327-336 

profile analysis. Harris, 20, 289-297 

square root. Bruner, Postman, and 
Mosteller, 15, 63-72 

Transitional probabilities, stochastic model. 
Miller, 17, 149-167; Kao, 18, 241-243; 
Goodman, 18, 245-248 

— information. McGill, 19, 97- 

1 


Trends, comparisons. Lindquist, 12, 65-78 
reliability. Alexander, 12, 79-99 
significance. Lindquist, 12, 65-78 

a. matrix, inverse. Fruchter, 14, 

True ability, metric. Lord, 18, 57-76 

True scores, estimation. Lord, 18, 57-76 
standard error. Lord, 20, 1-22 

Truncated distributions, estimation of 

correlation. Weichelt, 11, 215-221 
estimation of parameters. Gulliksen, 16, 
285-296 
Truncated multivariate normal. Birnbaum, 
Paulson, and Andrews, 15, 191-204; 
— Rafferty, and Deemer, 15, 339- 


Truncated normal, biserial correlation. 
Maritz, 18, 97-110 

Truncated normal curve, mean of deviates. 
Leverett, 12, 141-152 

Truncated normal distribution. Birnbaum, 
15, 385-389 

Unfolding technique, scaling. Coombs, 19, 
183-194 

Unifactor tests. Bedell, 15, 419-430 

Univocal factor scores. Guilford and 
Michael, 13, 1-22 

Univocal tests, construction. Wherry and 
Winer, 18, 161-179; Loevinger, Gleser, 
and DuBois, 18, 309-317 

scoring formulas. Fruchter, 18, 257-265 

Unweighted means analysis, analysis of 
_ and covariance. Tsao, 11, 107- 
128 
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Utility, test. Cronbach, 19, 263-270 
Utility scales, construction. Coombs, 19, 
183-194 
Validity, correction for chance success. 
Johnson, 12, 51-58 
difficulty. Brogden, 11, 197-214 
distribution of scores. Ferguson, 14, 61-68 
effect of cutting score. Birnbaum, 15, 
5-380 


item. Gulliksen, 16, 285-296 

item difficulty. Johnson, 12, 51-58 

item selection. Gleser and DuBois, 16, 
129-139; Webster, 18, 207-211 

length. Cronbach, 19, 263-270 

maximum. Horst, 16, 57-66 

multiple-choice vs. answer-only items. 
Plumlee, 19, 65-70 

optimum difficulty. Cronbach and War- 
rington, 17, 127-147; Lord, 17, 181-194 

persistence test. French, 13, 271-277 

prediction. Horst, 16, 57-66 

reliability. Tucker, 11, 1-13 

speeded tests. Myers, 17, 347-352 

test length. Horst, 13, 125-134; Horst, 
14, 79-88; Long and Burr, 14, 137-161; 
Taylor, 15, 391-406; Horst, 15, 407- 
408; Horst, 16, 189-202 

USAFI tests. Findley and Andregg, 14, 
47-60 

Variance analysis (see Analysis of variance). 
Variance components, error term. Binder, 

20, 29-50 
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Vector spaces. Reiersgl, 15, 121-149 

Votaw’s test. Green, 15, 251-257 

War, psychology. Richardson, 13, 147-174, 
197-232 


Weighting, suppressing factors. Guilford 
and Michael, 13, 1-22 
univocal factor scores. Guilford and 
Michael, 13, 1-22 
Weights, discriminant function. Wherry, 
12, 189-195 
length of test. Horst, 13, 125-134 
optimum classification. Thorndike, 15, 
215-235 
test selection. Wherry and Gaylord, 11, 
173-183 
Wherry-Doolittle test selection procedure, 
integral weights. Wherry and Gaylord, 
11, 173-183 
suppressor variables. Wherry, 11, 239-247 
time adjustments. Long and Burr, 14, 
137-161 
Wherry-Gaylord iterative factor analysis. 
Wherry, Campbell, and Perloff, 16, 
67-74; Wherry and Winer, 18, 161-179 
Wilks’ criterion, equality of standard error 
of measurement. Green, 15, 251-257 
Wilks’ criterion for homogeneity of inter- 
correlation matrices. Gulliksen and 
Wilks, 15, 91-114 
Wilks-Votaw criterion parallel tests. Lord, 
20, 1-22 
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